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Sir: 



BOX DAC 

Assistant Commissioner for Patents 
Washington, D.C. 20231 




The present petition is being filed by the Applicants to reque^fiat the Patent and 
Trademark Office (PTO) provide an appropriate serial number to a PCT application 
(Applicant docket number LEX-0118-PCT) filed by Applicants on January 25, 2001. The 
present petition is deemed necessary because the PTO has apparently accorded the above- 
identified PCT application with a U.S. Patent Application Serial Number. While considering 
the present petition, please consider the following summary of the relevant facts: 

On January 25, 2001, Applicants mailed a package to the PTO that contained two 
separate patent applications. The first application was the presently described PCT 
application and, as evidenced by the complete copy of the filed materials-- Exhibit A, was 
clearly directed to "Box PCT." The second application was a U.S. nonprovisional application 
(that generally corresponded to the U.S. version of the PCT application, a complete copy of 
the U.S. application as filed is provided as Exhibit B) that was clearly directed to "Box Patent 
Application." On February 8, 2001, Applicants received a stamped received postcard 
verifying the PTO's receipt of the PCT application (Exhibit C), and a separate stamped 
received postcard for U.S. nonprovisional patent application (Exhibit D). Upon inspection of 
the postcards, it became apparent to Applicants' representative that the presently described 
PCT application (Atty. docket number LEX-0118-PCT) had been erroneously accorded a 



U.S. Serial Number (09/769,952). In fact, the same serial number was apparently accorded 
to the corresponding U.S. nonprovisional application (Atty docket number LEX-0118-USA) 
and the PCT application at issue. 

As shown in the stamped received postcard for the PCT application (Exhibit D), the 
postcard was clearly marked Box PCT [bold in original] and, when filed, the postcard had 
been attached to a complete PCT Application filed in conjunction with a PCT Request, PCT 
(Annex - Fee Calculation Sheet), and a PCT Power of Attorney (Exhibit A). There were also 
two computer diskettes submitted. One of these diskettes was clearly marked both on the 
disk label and the disk mailer as "PCT Easy" along with the attorney docket number (LEX- 
01 18-PCT). The second diskette was clearly marked both on the disk label and disk mailer 
as "Sequence Listing" along with the attorney docket number (LEX-01 18-PCT). After 
contacting the PCT Help Desk, Applicants' representative was able to confirm that the PTO 
had apparently placed the same U.S. Patent Application Serial Number on the presently 
described PCT application and the related U.S. nonprovisional application that had been 
mailed along with the PCT application at issue. From what Applicants' representative and the 
PCT help desk could surmise from the PTO files, it appears that the PTO had apparently 
erroneously assumed that the Utility Application Transmittal sheet from the U.S. nonprovisional 
application somehow related to both the U.S. application and the separate PCT application 
and mistakenly combined both the presently described PCT application and the U.S. 
nonprovisional application. In fact, Applicants have been informed that both the PCT 
application at issue and the U.S. nonprovisional application have been mistakenly placed in the 
same file (for the U.S. application) at the PTO. Applicants also note that, according to the 
PCT Help Desk, the computer diskettes received by the PTO are apparently missing from the 
file. Given the rather unusual nature of the present predicament, the PCT Help Desk has also 
informed Applicants' representative that the present situation can only be rectified by 
Applicants filing the present petition. 

Accordingly, the Applicants hereby petition the PTO to remove the presently 
described PCT application from the U.S. application file, forward the complete PCT 
Application identifiable by Atty Docket Number LEX-01 18-PCT to the appropriate office 
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(Box PCT), and accord the above identified PCT Application with an International Application 
Number with the filing date of January 25, 2001. 

The Commissioner is authorized to charge the requisite Petition Fee under 37 C.F.R. 
section 1 . 1 7(h) to Deposit Account No. 50-0892, or for any matter in connection with this 
Petition, which may be required. 

Respectfully submitted, 




Date Lance K. Ishimoto Reg. No. 41,866 



LEXICON GENETICS INCORPORATED 
4000 Research Forest Drive 
The Woodlands, TX 77381 
(281) 863-3333 
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NOVEL HUMAN ENZYMES AND 
POLYNUCLEOTIDES ENCODING THE SAME 

The present application claims the benefit of U.S. 
Provisional Application Number 60/179,000 which was filed on 
January 28, 2000 and is herein incorporated by reference in its 
entirety . 

1. INTRODUCTION 
The -present invention relates to the discovery, 
identification, and characterization of novel human 

polynucleotides encoding proteins sharing sequence similarity with 
mammalian enzymes. The invention encompasses the described 
polynucleotides, host cell expression systems, the encoded 
protein, fusion proteins, polypeptides and peptides, antibodies to 
the encoded proteins and peptides, and sequencet ically engineered 
animals that either lack or over express the disclosed sequences, 
antagonists and agonists of the proteins, and other compounds that 
modulate the expression or activity of the proteins encoded by the 
disclosed polynucleotides that can be used for diagnosis, drug 
screening, clinical trial monitoring and the treatment of 
physiological disorders . 

2. BACKGROUND OF THE INVENTION 
Enzymes are biological catalysts that modify biological 
substrates, including proteins, as part of degradation, 
maturation, catabolic, metabolic, differentiation, and secretory 
pathways within the body. Enzyme abnormalities have thus been 
associated with, inter alia, growth, development, protein and 
cellular senescence, cancer, or other diseases. 

3. SUMMARY OF THE INVENTION 
The present invention relates to the discovery, 
identification, and characterization of nucleotides that encode 
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novel human proteins, and the corresponding amino acid sequences 
of these proteins. The novel human proteins (NHPs) described for 
the first time herein share structural similarity with nitrilase 
proteins from a wide variety of living organisms. 
5 The novel human nucleic acid (cDNA) sequences described 

herein, encode proteins/open reading frames (ORFs) of 276, 159, 
121, 168, 130, 152, and 285 amino acids in length (see 
respectively SEQ ID NOS: 2, 4, 6, 8, 10, 12 and 14) . 

The invention also encompasses agonists and antagonists of 
10 the described NHPs including small molecules, large molecules, 
mutant bJHPs, or portions thereof that compete with native NHPs, 
NHP peptides, and antibodies, as well as nucleotide sequences that 
can be used to inhibit the expression of the described NHPs (e.g., 
antisense and ribozyme molecules, and gene or regulatory sequence 
15 replacement constructs) or to enhance the expression of the 
described NHPs (e.g., expression constructs that place the 
described sequence under the control of a strong promoter system) , 
and transgenic animals that express a NHP transgene, or "knock- 
outs" (which can be conditional) that do not express a functional 
20 NHP. 

Further, the present invention also relates to processes for 
identifying compounds that modulate, i.e., act as agonists or 
antagonists, of NHP expression and/or NHP activity that utilize 
purified preparations of the described NHP and/or NHP product, or 
25 cells expressing the same. Such compounds can be used as 

therapeutic agents for the treatment of any of a wide variety of 
symptoms associated with biological disorders or imbalances. 

4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
30 The Sequence Listing provides the sequences of the NHP ORFs 

encoding the described NHP amino acid sequences. SEQ ID NOS: 15 
describe representative a nit rilase-like ORF with flanking 
sequences . 
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5. DETAILED DESCRIPTION OF THE INVENTION 
The NHPs, described for the first time herein, are novel 
proteins that are expressed in, inter alia, human cell lines, gene 
trapped cells and human brain, fetal brain, pituitary, cerebellum, 
5 spinal cord, thymus, spleen, lymph node, bone marrow, trachea, 
lung, kidney, fetal liver, liver, prostate, testis, thyroid, 
adrenal gland, pancreas, salivary gland, stomach, small intestine, 
colon, skeletal muscle, heart, placenta, mammary gland, adipose, 
skin, esophagus, bladder, pericardium, hypothalamus, ovary, fetal 
10 kidney, and fetal lung cells. 

The described sequences were compiled from gene trapped cDNAs 
and clones isolated from human prostate, lymph node, pituitary, 
mammary gland, and kidney cDNA library (Edge Biosystems, 
Gaithersburg, MD) . The present invention encompasses the 
15 nucleotides presented in the Sequence Listing, host cells 

expressing such nucleotides, the expression products of such 
nucleotides, and: (a) nucleotides that encode mammalian homologs 
of the described sequences, including the specifically described 
NHPs, and the NHP products; (b) nucleotides that encode one or 
20 more portions of a NHP that correspond to functional domains of 

the NHP, and the polypeptide products specified by such nucleotide 
sequences, including but not limited to the novel regions of any 
active domain (s); (c) isolated nucleotides that encode mutant 
versions, engineered or naturally occurring, of a described NHP in 
25 which all or a part of at least one domain is deleted or altered, 
and the polypeptide products specified by such nucleotide 
sequences, including but not limited to soluble proteins and 
peptides in which all or a portion of the signal sequence is 
deleted; (d) nucleotides that encode chimeric fusion proteins 
30 containing all or a portion of a coding region of a NHP, or one of 
its domains (e.g., a receptor or ligand binding domain, accessory 
protein/self-association domain, etc.) fused to another peptide or 
polypeptide; or (e) therapeutic or diagnostic derivatives of the 
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described polynucleotides such as oligonucleotides, antisense 
polynucleotides, ribozymes, dsRNA/ or gene therapy constructs 
comprising a sequence first disclosed in the Sequence Listing. 

As discussed above, the present invention includes: (a) the 
5 human DNA sequences presented in the Sequence Listing (and vectors 
comprising the same) and additionally contemplates any nucleotide 
sequence encoding a contiguous NHP open reading frame (ORF) , or a 
contiguous exon splice junction first described in the Sequence 
Listing, that hybridizes to a complement of a DNA sequence 
10 presented in the Sequence Listing under highly stringent 

conditions, e.g., hybridization to filter-bound DNA in 0.5 M 
NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and 
washing in 0.1xSSC/0.1% SDS at 68°C (Ausubel F.M. et al., eds . , 
1989, Current Protocols in Molecular Biology, Vol. I, Green 
15 Publishing Associates, Inc., and John Wiley & sons, Inc., New 
York, at p. 2.10.3) and encodes a functionally equivalent gene 
product. Additionally contemplated are any nucleotide sequences 
that hybridize to the complement of the DNA sequence that encode 
and express an amino acid sequence presented in the Sequence 
20 Listing under moderately stringent conditions, e.g., washing in 
0.2xSSC/0.1% SDS at 42°C (Ausubel et al., 1989, supra), yet still 
encode a functionally equivalent NHP product. Functional 
equivalents of a NHP include naturally occurring NHPs present in 
other species and mutant NHPs whether naturally occurring or 
25 engineered (by site directed mutagenesis, gene shuffling, directed 
evolution as described in, for example, U.S. Patent No. 
5,837,458). The invention also includes degenerate nucleic acid 
variants of the disclosed NHP polynucleotide sequences. 

Additionally contemplated are polynucleotides encoding a NHP 
30 ORF, or its functional equivalent, encoded by a polynucleotide 

sequence that is about 99, 95, 90, or about 85 percent similar or 
identical to corresponding regions of the nucleotide sequences of 
the Sequence Listing (as measured by BLAST sequence comparison 
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analysis using, for example, the GCG sequence analysis package 
using standard default settings) . 

The invention also includes nucleic acid molecules, 
preferably DNA molecules, that hybridize to, and are therefore the 

5 complements of, the described NHP nucleotide sequences. Such 
hybridization conditions may be highly stringent or less highly 
stringent, as described above. In instances where the nucleic 
acid molecules are deoxyoligonucleot ides ("DNA oligos"), such 
molecules are generally about 16 to about 100 bases long, or about 
10 20 to about 80, or about 34 to about 45 bases long, or any 
variation or combination of sizes represented therein that 
incorporate a contiguous region of sequence first disclosed in the 
Sequence Listing. Such oligonucleotides can be used in 
conjunction with the polymerase chain reaction (PCR) to screen 

15 libraries, isolate clones, and prepare cloning and sequencing 

templates, etc. 

Alternatively, such NHP oligonucleotides can be used as 
hybridization probes for screening libraries, and assessing gene 
expression patterns (particularly using a micro array or high- 
20 throughput "chip" format) . Additionally, a series of the 

described NHP oligonucleotide sequences, or the complements 
thereof, can be used to represent all or a portion of the 
described NHP sequences. An oligonucleotide or polynucleotide 
sequence first disclosed in at least a portion of one or more of 
25 the sequences of SEQ ID NOS : 1-15 can be used as a hybridization 
probe in conjunction with a solid support matrix/substrate 
(resins, beads, membranes, plastics, polymers, metal or metallized 
substrates, crystalline or polycrystalline substrates, etc.). Of 
particular note are spatially addressable arrays (i.e., gene 
30 chips, microtiter plates, etc.) of oligonucleotides and 

polynucleotides, or corresponding oligopeptides and polypeptides, 
wherein at least one of the biopolymers present on the spatially 
addressable array comprises an oligonucleotide or polynucleotide 
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sequence first disclosed in at least one of the sequences of SEQ 
ID NOS: 1-15, or an amino acid sequence encoded thereby. Methods 
for attaching biopolymers to, or synthesizing biopolymers on, 
solid support matrices, and conducting binding studies thereon are 
5 disclosed in, inter alia, U.S. Patent Nos. 5,700,637, 5,556,752, 
5,744,305, 4,631,211, 5,445,934, 5,252,743, 4,713,326, 5,424,186, 
and 4,689,405 the disclosures of which are herein incorporated by 
reference in their entirety. 

Addressable arrays comprising sequences first disclosed in 
10 SEQ ID NOS: 1-15 can be used to identify and characterize the 
temporal and tissue specific expression of a gene. These 
addressable arrays incorporate oligonucleotide sequences of 
sufficient length to confer the required specificity, yet be 
within the limitations of the production technology. The length 
15 of these probes is within a range of between about 8 to about 2000 
nucleotides. Preferably the probes consist of 60 nucleotides and 
more preferably 25 nucleotides from the sequences first disclosed 
in SEQ ID NOS: 1-15 . 

For example, a series of the described oligonucleotide 
20 sequences, or the complements thereof, can be used in chip format 
to represent all or a portion of the described sequences. The 
oligonucleotides, typically between about 16 to about 40 (or any 
whole number within the stated range) nucleotides in length can 
partially overlap each other and/or the sequence may be 
25 represented using oligonucleotides that do not overlap. 

Accordingly, the described polynucleotide sequences shall 
typically comprise at least about two or three distinct 
oligonucleotide sequences of at least about 8 nucleotides in 
length that are each first disclosed in the described Sequence 
30 Listing. Such oligonucleotide sequences can begin at any 

nucleotide present within a sequence in the Sequence Listing and 
proceed in either a sense (5'-to-3') orientation vis-a-vis the 
described sequence or in an antisense orientation. 
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Microarray-based analysis allows the discovery of broad 
patterns of genetic activity, providing new understanding of gene 
functions and generating novel and unexpected insight into 
transcriptional processes and biological mechanisms. The use of 
5 addressable arrays comprising sequences first disclosed in SEQ ID 
N0S:1-15 provides detailed information about transcriptional 
changes involved in a specific pathway, potentially leading to the 
identification of novel components or gene functions that manifest 
themselves as novel phenotypes. 
10 Probes consisting of sequences first disclosed in SEQ ID 

NOS:l-15 can also be used in the identification, selection and 
validation of novel molecular targets for drug discovery. The use 
of these unique sequences permits the direct confirmation of drug 
targets and recognition of drug dependent changes in gene 
15 expression that are modulated through pathways distinct from the 

drugs intended target. These unique sequences therefore also have 
utility in defining and monitoring both drug action and toxicity. 

As an example of utility, the sequences first disclosed in 
SEQ ID N0S:1-15 can be utilized in microarrays or other assay 
20 formats, to screen collections of genetic material from patients 
who have a particular medical condition. These investigations can 
also be carried out using the sequences first disclosed in SEQ ID 
NOS:l-15 in silico and by comparing previously collected genetic 
databases and the disclosed sequences using computer software 
25 known to those in the art. 

Thus the sequences first disclosed in SEQ ID NOS:l-15 can be 
used to identify mutations associated with a particular disease 
and also as a diagnostic or prognostic assay. 

Although the presently described sequences have been 
30 specifically described using nucleotide sequence, it should be 

appreciated that each of the sequences can uniquely be described 
using any of a wide variety of additional structural attributes, 
or combinations thereof. For example, a given sequence can be 
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described by the net composition of the nucleotides present within 
a given region of the sequence in conjunction with the presence of 
one or more specific oligonucleotide sequence (s) first disclosed 
in the SEQ ID NOS : 1-15. Alternatively, a restriction map 
specifying the relative positions of restriction endonuclease 
digestion sites, or various palindromic or other specific 
oligonucleotide sequences can be used to structurally describe a 
given sequence. Such restriction maps, which are typically 
generated by widely available computer programs (e.g., the 
University of Wisconsin GCG sequence analysis package, SEQUENCHER 
3.0, Gene Codes Corp., Ann Arbor, MI, etc.), can optionally be 
used in conjunction with one or more discrete nucleotide 
sequence (s) present in the sequence that can be described by the 
relative position of the sequence relatve to one or more 
additional sequence (s) or one or more restriction sites present in 
the disclosed sequence. 

For oligonucleotide probes, highly stringent conditions may 
refer, for example, to washing in 6xSSC/0.05% sodium pyrophosphate 
at 37°C (for 14-base oligos), 48°C (for 17-base oligos), 55°C (for 
20-base oligos), and 60°C (for 23-base oligos). These nucleic 
acid molecules may encode or act as NHP gene antisense molecules, 
useful, for example, in NHP gene regulation (for and/or as 
antisense primers in amplification reactions of NHP nucleic acid 
sequences) . With respect to NHP gene regulation, such techniques 
can be used to regulate biological functions. Further, such 
sequences may be used as part of ribozyme and/or triple helix 
sequences that are also useful for NHP gene regulation. 

Inhibitory antisense or double stranded oligonucleotides can 
additionally comprise at least one modified base moiety which is 
selected from the group including but not limited to 
5-f luorouracil , 5-bromouracil , 5-chlorouracil , 5-iodouracil , 
hypoxanthine, xantine, 4-acetylcytosine , 5- ( carboxyhydroxylmethyl ) 
uracil , 5-carboxymethylaminomethyl-2-thiouridine , 
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5-carboxymethylaminomethyluracil, dihydrouracil , beta-D- 
galactosylqueosine, inosine, N 6- isopentenyl adenine, 

1- methylguanine, 1-methylinosine, 2 , 2-dimethylguanine , 

2- methyladenine, 2-methylguanine, 3-methylcytosine , 
5 5-methylcytosine, N6-adenine, 7-methylguanine , 

5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil , 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil , 
5-methoxyuracil, 2-methylthio-N6-isopentenyladenine , uracil-5- 
oxyacetic acid (v) , wybutoxosine , pseudouracil, queosine, 

10 2-thiocytosine, 5-methyl-2-thiouracil , 2-thiouracil , 4-thiouracil, 
5-methyluracil, uracil-5-oxyacet ic acid methylester , uracil- 
5-oxyacetic acid (v) , 5-methyl-2-thiouracil / 3- ( 3-amino-3-N-2- 
carboxypropyl) uracil, (acp3)w, and 2 , 6-diaminopur ine . 

The antisense oligonucleotide can also comprise at least one 

15 modified sugar moiety selected from the group including but not 
limited to arabinose, 2-f luoroarabinose , xylulose, and hexose. 

In yet another embodiment, the antisense oligonucleotide will 
comprise at least one modified phosphate backbone selected from 
the group consisting of a phosphorothioate , a phosphorodithioate , 

20 a phosphoramidothioate, a phosphoramidate , a phosphordiamidate , a 
methylphosphonate, an alkyl phosphotr iester , and a formacetal or 
analog thereof. 

In yet another embodiment, the antisense oligonucleotide is 
an a-anomeric oligonucleotide. An a-anomeric oligonucleotide 

25 forms specific double-stranded hybrids with complementary RNA in 
which, contrary to the usual p-units, the strands run parallel to 
each other (Gautier et al., 1987, Nucl . Acids Res. 15:6625-6641). 
The oligonucleotide is a 2'-0-methylribonucleotide (Inoue et al., 
1987, Nucl. Acids Res. 25:6131-6148), or a chimeric RNA-DNA 

30 analogue (Inoue et al. f 1987, FEBS Lett. 215:327-330). 

Alternatively, double stranded RNA can be used to disrupt the 
expression and function of a targeted NHP. 
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Oligonucleotides of the invention can be synthesized by 
standard methods known in the art, e.g. by use of an automated DNA 
synthesizer (such as are commercially available from Biosearch, 
Applied Biosystems, etc.). As examples, phosphorothioate 
5 oligonucleotides can be synthesized by the method of Stein et al. 
(1988, Nucl. Acids Res. 25:3209), and methylphosphonate 
oligonucleotides can be prepared by use of controlled pore glass 
polymer supports (Sarin et al., 1988, Proc. Natl. Acad. Sci. 
U.S.A. SS:7448-7451) , etc. 
10 Low stringency conditions are well known to those of skill in 

the art, and will vary predictably depending on the specific 
organisms from which the library and the labeled sequences are 
derived. For guidance regarding such conditions see, for example, 
Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual (and 
15 periodic updates thereof), Cold Springs Harbor Press, N.Y.; and 
Ausubel et al., 1989, Current Protocols in Molecular Biology, 
Green Publishing Associates and Wiley Interscience , N.Y. 

Alternatively, suitably labeled NHP nucleotide probes can be 
used to screen a human genomic library using appropriately 
20 stringent conditions or by PCR. The identification and 
characterization of human genomic clones is helpful for 
identifying polymorphisms (including, but not limited to, 
nucleotide repeats, microsatellite alleles, single nucleotide 
polymorphisms, or coding single nucleotide polymorphisms) , 
25 determining the genomic structure of a given locus/allele, and 

designing diagnostic tests. For example, sequences derived from 
regions adjacent to the intron/exon boundaries of the human gene 
can be used to design primers for use in amplification assays to 
detect mutations within the exons, introns, splice sites (e.g., 
30 splice acceptor and/or donor sites), etc., that can be used in 
diagnostics and pharmacogenomics. 

Further, a NHP homolog can be isolated from nucleic acid from 
an organism of interest by performing PCR using two degenerate or 
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"wobble" oligonucleotide primer pools designed on the basis of 
amino acid sequences within the NHP products disclosed herein. 
The template for the reaction may be total RNA, mRNA, and/or cDNA 
obtained by reverse transcription of mRNA prepared from human or 
5 non-human cell lines or tissue known or suspected to express an 
allele of a NHP gene. 

The PCR product can be subcloned and sequenced to ensure that 
the amplified sequences represent the sequence of the desired NHP 
sequence. The PCR fragment can then be used to isolate a full 

10 length cDNA clone by a variety of methods. For example, the 
amplified fragment can be labeled and used to screen a cDNA 
library, such as a bacteriophage cDNA library. Alternatively, the 
labeled fragment can be used to isolate genomic clones via the 
screening of a genomic library. 

15 PCR technology can also be used to isolate full length cDNA 

sequences. For example, RNA can be isolated, following standard 
procedures, from an appropriate cellular or tissue source (i.e., 
one known, or suspected, to express a NHP sequence, such as, for 
example, testis tissue) . A reverse transcription (RT) reaction 

20 can be performed on the RNA using an oligonucleotide primer 

specific for the most 5 1 end of the amplified fragment for the 
priming of first strand synthesis. The resulting RNA/DNA hybrid 
may then be "tailed" using a standard terminal transferase 
reaction, the hybrid may be digested with RNase H, and second 

25 strand synthesis may then be primed with a complementary primer. 
Thus, cDNA sequences upstream of the amplified fragment can be 
isolated. For a review of cloning strategies that can be used, 
see e.g., Sambrook et al., 1989, supra. 

A cDNA encoding a mutant NHP sequence can be isolated, for 

30 example, by using PCR. In this case, the first cDNA strand may be 
synthesized by hybridizing an oligo-dT oligonucleotide to mRNA 
isolated from tissue known or suspected to be expressed in an 
individual putatively carrying a mutant NHP allele, and by 
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extending the new strand with reverse transcriptase. The second 
strand of the cDNA is then synthesized using an oligonucleotide 
that hybridizes specifically to the 5 1 end of the normal gene. 
Using these two primers, the product is then amplified via PCR, 
optionally cloned into a suitable vector, and subjected to DNA 
sequence analysis through methods well known to those of skill in 
the art. By comparing the DNA sequence of the mutant NHP allele 
to that of a corresponding normal NHP allele, the mutation (s) 
responsible for the loss or alteration of function of the mutant 
NHP gene product can be ascertained. 

Alternatively, a genomic library can be constructed using DNA 
obtained from an individual suspected of or known to carry a 
mutant NHP allele (e.g., a person manifesting a NHP-associated 
phenotype such as, for example, obesity, high blood pressure, 
connective tissue disorders, infertility, etc.), or a cDNA library 
can be constructed using RNA from a tissue known, or suspected, to 
express a mutant NHP allele. A normal NHP sequence, or any 
suitable fragment thereof, can then be labeled and used as a probe 
to identify the corresponding mutant NHP allele in such libraries. 
Clones containing mutant NHP sequences can then be purified and 
subjected to sequence analysis according to methods well known to 
those skilled in the art. 

Additionally, an expression library can be constructed 
utilizing cDNA synthesized from, for example, RNA isolated from a 
tissue known, or suspected, to express a mutant NHP allele in an 
individual suspected of or known to carry such a mutant allele. 
In this manner, gene products made by the putatively mutant tissue 
can be expressed and screened using standard antibody screening 
techniques in conjunction with antibodies raised against normal 
NHP product, as described below. (For screening techniques, see, 
for example, Harlow, E. and Lane, eds . , 1988, "Antibodies: A 
Laboratory Manual", Cold Spring Harbor Press, Cold Spring Harbor.) 
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Additionally, screening can be accomplished by screening with 
labeled NHP fusion proteins, such as, for example, alkaline 
phosphatase-NHP or NHP-alkaline phosphatase fusion proteins. 
In cases where a NHP mutation results in an expressed gene product 
5 with altered function (e.g., as a result of a missense or a 

frameshift mutation) , polyclonal antibodies to NHP are likely to 
cross-react with a corresponding mutant NHP gene product. Library 
clones detected via their reaction with such labeled antibodies 
can be purified and subjected to sequence analysis according to 

10 methods well known in the art. 

The invention also encompasses (a) DNA vectors that contain 
any of the foregoing NHP coding sequences and/or their complements 
(i.e., antisense) ; (b) DNA expression vectors that contain any of 
the foregoing NHP coding sequences operatively associated with a 

15 regulatory element that directs the expression of the coding 

sequences (for example, baculo virus as described in U.S. Patent 
No. 5,869,336 herein incorporated by reference); (c) genetically 
engineered host cells that contain any of the foregoing NHP coding 
sequences operatively associated with a regulatory element that 

20 directs the expression of the coding sequences in the host cell; 
and (d) genetically engineered host cells that express an 
endogenous NHP sequence under the control of an exogenously 
introduced regulatory element (i.e., gene activation). As used 
herein, regulatory elements include, but are not limited to, 

25 inducible and non-inducible promoters, enhancers, operators and 
other elements known to those skilled in the art that drive and 
regulate expression. Such regulatory elements include but are not 
limited to the cytomegalovirus (hCMV) immediate early gene, 
regulatable, viral elements (particularly retroviral LTR 

30 promoters), the early or late promoters of SV40 adenovirus, the 
lac system, the trp system, the TAC system, the TRC system, the 
major operator and promoter regions of phage lambda, the control 
regions of fd coat protein, the promoter for 3-phosphoglycerate 
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kinase (PGK) , the promoters of acid phosphatase, and the promoters 
of the yeast a-mating factors. 

The present invention also encompasses antibodies and anti- 
idiotype antibodies (including Fab fragments), antagonists and 
5 agonists of a NHP, as well as compounds or nucleotide constructs 
that inhibit expression of a NHP gene (transcription factor 
inhibitors, antisense and ribozyme molecules, or gene or 
regulatory sequence replacement constructs) , or promote the 
expression of a NHP (e.g., expression constructs in which NHP 
10 coding sequences are operatively associated with expression 

control elements such as promoters, promoter /enhancers , etc.). 

The NHPs or NHP peptides, NHP fusion proteins, NHP nucleotide 
sequences, antibodies, antagonists and agonists can be useful for 
the detection of mutant NHPs or inappropriately expressed NHPs for 
15 the diagnosis of disease. The NHP proteins or peptides, NHP 

fusion proteins, NHP nucleotide sequences, host cell expression 
systems, antibodies, antagonists, agonists and genetically 
engineered cells and animals can be used for screening for drugs 
(or high throughput screening of combinatorial libraries) 
20 effective in the treatment of the symptomatic or phenotypic 

manifestations of perturbing the normal function of a NHP in the 
body. The use of engineered host cells and/or animals may offer 
an advantage in that such systems allow not only for the 
identification of compounds that bind to the endogenous receptor 
25 for a NHP, but can also identify compounds that trigger NHP- 
mediated activities or pathways. 

Finally, the NHP products can be used as therapeutics. For 
example, soluble derivatives such as NHP peptides/domains 
corresponding to NHP, NHP fusion protein products (especially NHP- 
30 Ig fusion proteins, i.e., fusions of a NHP, or a domain of a NHP, 
to an IgFc) , NHP antibodies and anti-idiotypic antibodies 
(including Fab fragments), antagonists or agonists (including 
compounds that modulate or act on downstream targets in a NHP- 
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mediated pathway) can be used to directly treat diseases or 
disorders. For instance, the administration of an effective 
amount of soluble NHP, or a NHP-IgFc fusion protein or an anti- 
idiotype antibody (or its Fab) that mimics the NHP could activate 
5 or effectively antagonize the endogenous NHP receptor. Nucleotide 
constructs encoding such NHP products can be used to genetically 
engineer host cells to express such products in vivo; these 
genetically engineered cells function as "bioreactors" in the body 
delivering a continuous supply of a NHP, a NHP peptide, or a NHP 

10 fusion protein to the body. Nucleotide constructs encoding 

functional NHP, mutant NHPs, as well as antisense and ribozyme 
molecules can also be used in "gene therapy" approaches for the 
modulation of NHP expression. Thus, the invention also 
encompasses pharmaceutical formulations and methods for treating 

15 biological disorders. 

Various aspects of the invention are described in greater 
detail in the subsections below. 

5.1 THE NHP SEQUENCES 

20 The cDNA sequences and the corresponding deduced amino acid 

sequences of the described NHP are presented in the Sequence 
Listing. SEQ ID NOS:15 describes the NHP ORFs as well as flanking 
regions. The NHP nucleotides were obtained from human cDNA 
libraries using probes and/or primers generated from human gene 

25 trapped sequence tags. Expression analysis has provided evidence 
that the described NHP can be expressed in a variety of human 
cells as well as gene trapped human cells. 

5.2 NHP AND NHP POLYPEPTIDES 
30 NHPs, polypeptides, peptide fragments, mutated, truncated, or 

deleted forms of the NHPs, and/or NHP fusion proteins can be 
prepared for a variety of uses. These uses include, but are not 
limited to, the generation of antibodies, as reagents in 
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diagnostic assays, for the identification of other cellular gene 
products related to a NHP, as reagents in assays for screening for 
compounds that can be as pharmaceutical reagents useful in the 
therapeutic treatment of mental, biological, or medical disorders 
5 and disease. 

The Sequence Listing discloses the amino acid sequence 
encoded by the described NHP polynucleotides. The NHPs display 
initiator methionines in DNA sequence contexts consistent with 
translation initiation sites, and apparently display signal 

10 sequences which can indicate that the described NHP ORFs are 
secreted proteins or possibly membrane associated. 

The NHP amino acid sequences of the invention include the 
amino acid sequences presented in the Sequence Listing as well as 
analogues and derivatives thereof, as well as any oligopeptide 

15 sequence of at least about 10-40, generally about 12-35, or about 
16-30 amino acids in length first disclosed in the Sequence 
Listing. Further, corresponding NHP homologues from other species 
are encompassed by the invention. In fact, any NHP encoded by the 
NHP nucleotide sequences described above are within the scope of 

20 the invention, as are any novel polynucleotide sequences encoding 
all or any novel portion of an amino acid sequence presented in 
the Sequence Listing. The degenerate nature of the genetic code 
is well known, and, accordingly, each amino acid presented in the 
Sequence Listing, is generically representative of the well known 

25 nucleic acid "triplet'' codon, or in many cases codons, that can 

encode the amino acid. As such, as contemplated herein, the amino 
acid sequences presented in the Sequence Listing, when taken 
together with the genetic code (see, for example, Table 4-1 at 
page 109 of "Molecular Cell Biology", 1986, J. Darnell et al. 

30 eds., Scientific American Books, New York, NY, herein incorporated 
by reference) are generically representative of all the various 
permutations and combinations of nucleic acid sequences that can 
encode such amino acid sequences. 
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The invention also encompasses proteins that are functionally 
equivalent to the NHPs encoded by the presently described 
nucleotide sequences as judged by any of a number of criteria, 
including, but not limited to, the ability to bind and cleave a 
5 substrate of a NHP, or the ability to effect an identical or 
complementary downstream pathway, or a change in cellular 
metabolism (e.g., proteolytic activity, ion flux, tyrosine 
phosphorylation, etc.). Such functionally equivalent NHP proteins 
include, but are not limited to, additions or substitutions of 

10 amino acid residues within the amino acid sequence encoded by the 
NHP nucleotide sequences described above, but which result in a 
silent change, thus producing a functionally equivalent gene 
product. Amino acid substitutions can be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity , 

15 hydrophilicity, and/or the amphipathic nature of the residues 

involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine; polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, and 

20 glutamine; positively charged (basic) amino acids include 

arginine, lysine, and histidine; and negatively charged (acidic) 
amino acids include aspartic acid and glutamic acid. 

A variety of host-expression vector systems can be used to 
express the NHP nucleotide sequences of the invention. Where, as 

25 in the present instance, the NHP products or NHP polypeptides can 
be produced in soluble or secreted forms (by removing one or more 
transmembrane domains where applicable) , the peptide or 
polypeptide can be recovered from the culture media. Such 
expression systems also encompass engineered host cells that 

30 express a NHP, or a functional equivalent, in situ. Purification 
or enrichment of NHP from such expression systems can be 
accomplished using appropriate detergents and lipid micelles and 
methods well known to those skilled in the art. However, such 
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engineered host cells themselves may be used in situations where 
it is important not only to retain the structural and functional 
characteristics of the NHP, but to assess biological activity, 
e.g. , in drug screening assays. 
5 The expression systems that may be used for purposes of the 

invention include but are not limited to microorganisms such as 
bacteria (e.g., E. coli, B. subtilis) transformed with recombinant 
bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors 
containing NHP nucleotide sequences; yeast (e.g., Saccharomyces , 

10 Pichia) transformed with recombinant yeast expression vectors 

containing NHP encoding nucleotide sequences; insect cell systems 
infected with recombinant virus expression vectors (e.g., 
baculovirus) containing NHP sequences; plant cell systems infected 
with recombinant virus expression vectors (e.g., cauliflower 

15 mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with 
recombinant plasmid expression vectors (e.g., Ti plasmid) 
containing NHP nucleotide sequences; or mammalian cell systems 
(e.g., COS, CHO, BHK, 293, 3T3) harboring recombinant expression 
constructs containing promoters derived from the genome of 

20 mammalian cells (e.g., metallothionein promoter) or from mammalian 
viruses (e.g., the adenovirus late promoter; the vaccinia virus • 
7 . 5K promoter ) . 

In bacterial systems, a number of expression vectors may be 
advantageously selected depending upon the use intended for the 

25 NHP product being expressed. For example, when a large quantity 
of such a protein is to be produced for the generation of 
pharmaceutical compositions of or containing NHP, or for raising 
antibodies to a NHP, vectors that direct the expression of high 
levels of fusion protein products that are readily purified may be 

30 desirable. Such vectors include, but are not limited, to the E. 
coli expression vector pUR278 (Ruther et al., 1983, EMBO J. 
2:1791), in which a NHP coding sequence may be ligated 
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individually into the vector in frame with the lacZ coding region 
so that a fusion protein is produced; pIN vectors (Inouye & 
Inouye, 1985, Nucleic Acids Res. 13:3101-3109; Van Heeke & 
Schuster, 1989, J. Biol. Chem. 254:5503-5509); and the like. pGEX 
5 vectors (Pharmacia or American Type Culture Collection) can also 
be used to express foreign polypeptides as fusion proteins with 
glutathione S-transf erase (GST) . In general, such fusion proteins 
are soluble and can easily be purified from lysed cells by 
adsorption to glutathione-agarose beads followed by elution in the 

10 presence of free glutathione. The PGEX vectors are designed to 
include thrombin or factor Xa protease cleavage sites so that the 
cloned target gene product can be released from the GST moiety. 

In an insect system, Autographs californica nuclear 
polyhidrosis virus (AcNPV) is used as a vector to express foreign 

15 sequences. The virus grows in Spodoptera frugiperda cells. A NHP 
coding sequence can be cloned individually into non-essential 
regions (for example the polyhedrin gene) of the virus and placed 
under control of an AcNPV promoter (for example the polyhedrin 
promoter) . Successful insertion of NHP coding sequence will 

20 result in inactivation of the polyhedrin gene and production of 
non-occluded recombinant virus (i.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene) . These 
recombinant viruses are then used to infect Spodoptera frugiperda 
cells in which the inserted sequence is expressed (e.g., see 

25 Smith et al. f 1983, J. Virol. 46: 584; Smith, U.S. Patent No. 
4,215,051) . 

In mammalian host cells, a number of viral-based expression 
systems may be utilized. In cases where an adenovirus is used as 
an expression vector, the NHP nucleotide sequence of interest may 
30 be ligated to an adenovirus transcription/translation control 

complex, e.g., the late promoter and tripartite leader sequence. 
This chimeric sequence may then be inserted in the adenovirus 
genome by in vitro or in vivo recombination. Insertion in a non- 
19 
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essential region of the viral genome (e.g., region El or E3) will 
result in a recombinant virus that is viable and capable of 
expressing a NHP product in infected hosts (e.g., See Logan & 
Shenk, 1984, Proc. Natl. Acad. Sci. USA 81 : 3655-3659) . Specific 
5 initiation signals may also be required for efficient translation 
of inserted NHP nucleotide sequences. These signals include the 
ATG initiation codon and adjacent sequences. In cases where an 
entire NHP sequence or cDNA, including its own initiation codon 
and adjacent sequences, is inserted into the appropriate 

10 expression vector, no additional translat ional control signals may 
be needed. However, in cases where only a portion of a NHP coding 
sequence is inserted, exogenous translational control signals, 
including, perhaps, the ATG initiation codon, must be provided. 
Furthermore, the initiation codon must be in phase with the 

15 reading frame of the desired coding sequence to ensure translation 
of the entire insert. These exogenous translational control 
signals and initiation codons can be of a variety of origins, both 
natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of appropriate transcription enhancer 

20 elements, transcription terminators, etc. (See Bittner et al., 
1987, Methods in Enzymol. 153:516-544). 

In addition, a host cell strain may be chosen that modulates 
the expression of the inserted sequences, or modifies and 
processes the gene product in the specific fashion desired. Such 

25 modifications (e.g., glycosylation) and processing (e.g., 

cleavage) of protein products may be important for the function of 
the protein. Different host cells have characteristic and 
specific mechanisms for the post-translational processing and 
modification of proteins and gene products. Appropriate cell 

30 lines or host systems can be chosen to ensure the correct 

modification and processing of the foreign protein expressed. To 
this end, eukaryotic host cells which possess the cellular 
machinery for proper processing of the primary transcript, 
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glycosylation, and phosphorylation of the gene product may be 
used. Such mammalian host cells include, but are not limited to, 
CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, and in 
particular, human cell lines. 

For long-term, high-yield production of recombinant proteins, 
stable expression is preferred. For example, cell lines which 
stably express the NHP sequences described above can be 
engineered. Rather than using expression vectors which contain 
viral origins of replication, host cells can be transformed with 
DNA controlled by appropriate expression control elements (e.g., 
promoter, enhancer sequences, transcription terminators, 
polyadenylation sites, etc.), and a selectable marker. Following 
the introduction of the foreign DNA, engineered cells may be 
allowed to grow for 1-2 days in an enriched media, and then are 
switched to a selective media. The selectable marker in the 
recombinant plasmid confers resistance to the selection and allows 
cells to stably integrate the plasmid into their chromosomes and 
grow to form foci which in turn can be cloned and expanded into 
cell lines. This method may advantageously be used to engineer 
cell lines which express the NHP product. Such engineered cell 
lines may be particularly useful in screening and evaluation of 
compounds that affect the endogenous activity of the NHP product. 

A number of selection systems may be used, including but not 
limited to the herpes simplex virus thymidine kinase (Wigler, et 
al., 1911, Cell 11:223), hypoxanthine-guanine 

phosphoribosyltransf erase (Szybalska & Szybalski, 1962, Proc. 
Natl. , Acad. Sci. USA 48:2026), and adenine 

phosphoribosyltransf erase (Lowy, et al., 1980, Cell 22:817) genes 
can be employed in tk", hgprt" or aprt" cells, respectively. Also, 
antimetabolite resistance can be used as the basis of selection 
for the following genes: dhfr, which confers resistance to 
methotrexate (Wigler, et al., 1980, Natl. Acad. Sci. USA 77:3567; 
O'Hare, et al., 1981, Proc. Natl. Acad. Sci. USA 78:1527); gpt, 
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which confers resistance to mycophenolic acid (Mulligan & Berg, 
1981, Proc. Natl. Acad. Sci . USA 78:2012); neo, which confers 
resistance to the aminoglycoside G-418 (Colberre-Garapin, et al., 
1981, J. Mol. Biol. 150:1); and hygro, which confers resistance to 
5 hygromycin (Santerre, et al., 1984, Gene 30:147). 

Alternatively, any fusion protein can be readily purified by 
utilizing an antibody specific for the fusion protein being 
expressed. For example, a system described by Janknecht et al. 
allows far the ready purification of non-denatured fusion proteins 

10 expressed in human cell lines (Janknecht, et al., 1991, Proc. 

Natl. Acad. Sci. USA 88:8972-8976). In this system, the gene of 
interest is subcloned into a vaccinia recombination plasmid such 
that the gene's open reading frame is translat ionally fused to an 
amino-terminal tag consisting of six histidine residues. Extracts 

15 from cells infected with recombinant vaccinia virus are loaded 

onto Ni 2+ • nitriloacet ic acid-agarose columns and his tidine- tagged 
proteins are selectively eluted with imidazole-containing buffers. 

Also encompassed by the present invention are fusion proteins 
that direct the NHP to a target organ and/or facilitate transport 

20 across the membrane into the cytosol. Conjugation of NHPs to 

antibody molecules or their Fab fragments could be used to target 
cells bearing a particular epitope. Attaching the appropriate 
signal sequence to the NHP would also transport the NHP to the 
desired location within the cell. Alternatively targeting of NHP 

25 or its nucleic acid sequence might be achieved using liposome or 
lipid complex based delivery systems. Such technologies are 
described in Liposomes: A Practical Approach , New, RRC ed., Oxford 
University Press, New York and in U.S. Patents Nos . 4,594,595, 
5,459,127, 5,948,767 and 6,110,490 and their respective 

30 disclosures which are herein incorporated by reference in their 
entirety. Additionally embodied are novel protein constructs 
engineered in such a way that they facilitate transport of the NHP 
to the target site or desired organ, where they cross the cell 
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membrane and/or the nucleus where the NHP can exert its functional 
activity. This goal may be achieved by coupling of the NHP to a 
cytokine or other ligand that provides targeting specificity, 
and/or to a protein transducing domain (see generally U.S. 
applications Ser. No. 60/111,701 and 60/056,713, both of which are 
herein incorporated by reference, for examples of such transducing 
sequences) to facilitate passage across cellular membranes and can 
optionally be engineered to include nuclear localization 
sequences . 

5.3 ANTIBODIES TO NHP PRODUCTS 
Antibodies that specifically recognize one or more epitopes 
of a NHP, or epitopes of conserved variants of a NHP, or peptide 
fragments of a NHP are also encompassed by the invention. Such 
antibodies include but are not limited to polyclonal antibodies, 
monoclonal antibodies (mAbs), humanized or chimeric antibodies, 
single chain antibodies, Fab fragments, F(ab') 2 fragments, 
fragments produced by a Fab expression library, ant i-idiotypic 
(anti-Id) antibodies, and epitope-binding fragments of any of the 
above . 

The antibodies of the invention may be used, for example, in 
the detection of NHP in a biological sample and may, therefore, be 
utilized as part of a diagnostic or prognostic technique whereby 
patients may be tested for abnormal amounts of NHP. Such 
antibodies may also be utilized in conjunction with, for example, 
compound screening schemes for the evaluation of the effect of 
test compounds on expression and/or activity of a NHP gene 
product. Additionally, such antibodies can be used in conjunction 
gene therapy to, for example, evaluate the normal and/or 
engineered NHP-expressing cells prior to their introduction into 
the patient. Such antibodies may additionally be used as a method 
for the inhibition of abnormal NHP activity. Thus, such 
antibodies may, therefore, be utilized as part of treatment 
methods . 
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For the production of antibodies, various host animals may be 
immunized by injection with the NHP, an NHP peptide (e.g., one 
corresponding to a functional domain of an NHP) , truncated NHP 
polypeptides (NHP in which one or more domains have been deleted) , 
5 functional equivalents of the NHP or mutated variant of the NHP. 
Such host animals may include but are not limited to pigs, 
rabbits, mice, goats, and rats, to name but a few. Various 
adjuvants may be used to increase the immunological response, 
depending on the host species, including but not limited to 

10 Freund's adjuvant (complete and incomplete), mineral salts such as 
aluminum hydroxide or aluminum phosphate, surface active 
substances such as lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, and potentially useful human adjuvants 
such as BCG (bacille Calmette-Guerin) and Corynebacterium parvum. 

15 Alternatively, the immune response could be enhanced by 

combination and or coupling with molecules such as keyhole limpet 
hemocyanin, tetanus toxoid, diptheria toxoid, ovalbumin, cholera 
toxin or fragments thereof. Polyclonal antibodies are 
heterogeneous populations of antibody molecules derived from the 

20 sera of the immunized animals. 

Monoclonal antibodies, which are homogeneous populations of 
antibodies to a particular antigen, can be obtained by any 
technique which provides for the production of antibody molecules 
by continuous cell lines in culture. These include, but are not 

25 limited to, the hybridoma technique of Kohler and Milstein, (1975, 
Nature 256:495-497; and U.S. Patent No. 4,376,110), the human B- 
cell hybridoma technique (Kosbor et al., 1983, Immunology Today 
4:72; Cole et. aL, 1983, Proc. Natl. Acad. Sci . USA 80: 2026-2030) , 
and the EBV-hybridoma technique (Cole et al., 1985, Monoclonal 

30 Antibodies And Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Such antibodies may be of any immunoglobulin class including IgG, 
IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma 
producing the mAb of this invention may be cultivated in vitro or 
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in vivo. Production of high titers of mAbs in vivo makes this the 
presently preferred method of production. 

In addition, techniques developed for the production of 
"chimeric antibodies" (Morrison et al., 1984, Proc. Natl. Acad. 
5 Sci., 82:6851-6855; Neuberger et al., 1984, Nature, 312:604-608; 
Takeda et al., 1985, Nature, 314:452-454) by splicing the genes 
from a mouse antibody molecule of appropriate antigen specificity 
together with genes from a human antibody molecule of appropriate 
biological activity can be used. A chimeric antibody is a 

10 molecule in which different portions are derived from different 
animal species, such as those having a variable region derived 
from a murine mAb and a human immunoglobulin constant region. Such 
technologies are described in U.S. Patents Nos . 6,075,181 and 
5,877,397 and their respective disclosures which are herein 

15 incorporated by reference in their entirety. Also favored is the 
production of fully humanized monoclonal antibodies as described 
in US Patent No. 6,150,584 and respective disclosures which are 
herein incorporated by reference in their entirety. 

Alternatively, techniques described for the production of 

20 single chain antibodies (U.S. Patent 4,946,778; Bird, 1988, 

Science 242:423-426; Huston et al., 1988, Proc. Natl. Acad. Sci. 
USA 85:5879-5883; and Ward et al., 1989, Nature 334:544-546) can 
be adapted to produce single chain antibodies against NHP gene 
products. Single chain antibodies are formed by linking the heavy 

25 and light chain fragments of the Fv region via an amino acid 
bridge, resulting in a single chain polypeptide. 

Antibody fragments which recognize specific epitopes may be 
generated by known techniques. For example, such fragments 
include, but are not limited to: the F(ab') 2 fragments which can 

30 be produced by pepsin digestion of the antibody molecule and the 
Fab fragments which can be generated by reducing the disulfide 
bridges of the F(ab') 2 fragments. Alternatively, Fab expression 
libraries may be constructed (Huse et al., 1989, Science, 
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246:1275-1281) to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. 

Antibodies to a NHP can, in turn, be utilized to generate 
anti-idiotype antibodies that "mimic" a given NHP, using 
5 techniques well known to those skilled in the art. (See, e.g., 

Greenspan & Bona, 1993, FASEB J 7 (5) : 4 37-4 4 4 ; and Nissinoff, 1991, 
J. Immunol. 1 4 7 (8) : 242 9-24 38 ) . For example antibodies which bind 
to a NHP domain and competitively inhibit the binding of NHP to 
its cogna-te receptor can be used to generate anti-idiotypes that 

10 "mimic" the NHP and, therefore, bind and activate or neutralize a 
receptor. Such ant i-idiotypic antibodies or Fab fragments of such 
anti-idiotypes can be used in therapeutic regimens involving a NHP 
signaling pathway . 

The present invention is not to be limited in scope by the 

15 specific embodiments described herein, which are intended as 

single illustrations of individual aspects of the invention, and 
functionally equivalent methods and components are within the 
scope of the invention. Indeed, various modifications of the 
invention, in addition to those shown and described herein will 

20 become apparent to those skilled in the art from the foregoing 
description. Such modifications are intended to fall within the 
scope of the appended claims. All cited publications, patents, 
and patent applications are herein incorporated by reference in 
their entirety. 
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WHAT IS CLAIMED IS : 

1. An isolated nucleic acid molecule comprising at 
least 24 contiguous bases of nucleotide sequence first disclosed 

5 in SEQ ID NO: 1. 

2. An isolated nucleic acid molecule comprising a 
nucleotide sequence that: 

(a) encodes the amino acid sequence shown in SEQ ID 
10 NO: 2; and 

(b) hybridizes under stringent conditions to the 
nucleotide sequence of SEQ ID NO: 1 or the 
complement thereof . 

15 3 . An isolated nucleic acid molecule encoding the 

amino acid sequence described in SEQ ID NO: 2. 

4. An isolated oligopeptide comprising at least about 
12 amino acids in a sequence first disclosed in SEQ ID NO: 2. 

20 
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ABSTRACT 

Novel human polynucleotide and polypeptide sequences are 
disclosed that can be used in therapeutic, diagnostic, and 
pharmacogenomic applications. 

5 
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SEQUENCE LISTING 



<110> LEXICON GENETICS INCORPORATED 

<120> Novel Human Enzymes and Polynucleotides 
Encoding the Same 



<130> LEX-0118-PCT 



<150> US 60/179, 000 
<151> 2000-01-28 



<160> 15 

<170> FastSEQ for Windows Version 4.0 



<210> 1 

<211> 831 

<212> DNA 

<213> Homo sapiens 



<400> 1 

atgacctctt 

gtcactcgcg 

ttgccggaat 

attcctggtg 

attggaggct 

gggcctgatg 

cctggaaaaa 

tttgatactc 

cttgcacaaa 

ctgaccactg 

gtgtatgtgg 

cacagcaccg 

atcgtgtatt 

tttagacaga 



tccgcttggc 
cttgtagctt 
gctttaattc 
aatccacaca 
ctatccctga 
gaactttact 
ttacatt tea 
ettactgeag 
tctacgcaca 
gaccagccca 
ccacagcctc 
tggtgaaccc 
cagacataga 
agegatcaga 



cctcatccag 
catcegggag 
tccatatgga 
gaagctttct 
agaggatget 
agcaaagtat 
agaatctaaa 
agtgggtctg 
gagaggctgc 
ttgggagtta 
tcctgcccgg 
ttggggggag 
cctgaagaag 
cctctatgct 



cttcagattt 
gcagcaacgc 
gcgaaatatt 
gaagtagcaa 
gggaaattat 
agaaagatcc 
acattgagtc 
ggcatctget 
cagctgttgg 
ettcagegaa 
gatgacaaag 
gttctageca 
ctggctgaaa 
gtggagatga 



cttccatcaa 


atcagataac 


60 


aaggagccaa 


aatagtttct 


120 


ttcctgaata 


tgcagagaaa 


180 


aggaatgeag 


catatatctc 


240 


ataacacctg 


tgctgtgttt 


300 


atctgtttga 


cattgatgtt 


360 


egggtgatag 


t ttctccaca 


420 


aegacatgeg 


gt ttgeagag 


480 


tatatccagg 


agcttttaat 


540 


geegggctgt 


tgataatcag 


600 


cctcctatgt 


tgcctgggga 


660 


aagctggcac 


agaagaagca 


720 


tacgccagca 


aatccccgtt 


780 


aaaageccta 


a 


831 



<210> 2 
<211> 276 
<212> PRT 

<213> Homo sapiens 



<400> 2 

Met Thr Ser Phe Arg Leu Ala Leu 

1 5 
Lys Ser Asp Asn Val Thr Arg Ala 
20 

Thr Gin Gly Ala Lys He Val Ser 

35 40 
Tyr Gly Ala Lys Tyr Phe Pro Glu 

50 55 
Ser Thr Gin Lys Leu Ser Glu Val 
65 70 
He Gly Gly Ser He Pro Glu Glu 
85 



He Gin Leu Gin He Ser Ser He 

10 15 

Cys Ser Phe He Arg Glu Ala Ala 
25 30 

Leu Pro Glu Cys Phe Asn Ser Pro 
45 

Tyr Ala Glu Lys He Pro Gly Glu 
60 

Ala Lys Glu Cys Ser He Tyr Leu 

75 80 

Asp Ala Gly Lys Leu Tyr Asn Thr 

90 95 
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Cys Ala Val Phe Gly Pro Asp Gly Thr Leu Leu Ala Lys Tyr Arg Lys 

100 105 110 

lie His Leu Phe Asp lie Asp Val Pro Gly Lys lie Thr Phe Gin Glu 

115 120 125 

Ser Lys Thr Leu Ser Pro Gly Asp Ser Phe Ser Thr Phe Asp Thr Pro 

130 135 140 

Tyr Cys Arg Val Gly Leu Gly He Cys Tyr Asp Met Arg Phe Ala Glu 
145 150 155 160 

Leu Ala Gin He Tyr Ala Gin Arg Gly Cys Gin Leu Leu Val Tyr Pro 

165 170 175 

Gly Ala Phe Asn Leu Thr Thr Gly Pro Ala His Trp Glu Leu Leu Gin 

180 185 190 

Arg Ser Arg Ala Val Asp Asn Gin Val Tyr Val Ala Thr Ala Ser Pro 

195 200 205 

Ala Arg Asp Asp Lys Ala Ser Tyr Val Ala Trp Gly His Ser Thr Val 

210 215 220 

Val Asn Pro Trp Gly Glu Val Leu Ala Lys Ala Gly Thr Glu Glu Ala 
225 230 235 240 

He Val Tyr Ser Asp He Asp Leu Lys Lys Leu Ala Glu He Arg Gin 

245 250 255 

Gin He Pro Val Phe Arg Gin Lys Arg Ser Asp Leu Tyr Ala Val Glu 

260 265 270 

Met Lys Lys Pro 
275 



<210> 3 

<211> 480 

<212> DNA 

<213> Homo sapiens 



tgccacacca tgctgtaggg agttaacttt tcatttgtgc 60 

ttactgcaga gtgggtctgg gcatctgcta cgacatgcgg 120 

ctacgcacag agaggctgcc agctgttggt atatccagga 180 

accagcccat tgggagttac ttcagcgaag ccgggctgtt 240 

cacagcctct cctgcccggg atgacaaagc ctcctatgtt 300 

ggtgaaccct tggggggagg ttctagccaa agctggcaca 360 

agacatagac ctgaagaagc tggctgaaat acgccagcaa 420 

gcgatcagac ctctatgctg tggagatgaa aaagccctaa 480 

<210> 4 
<211> 159 
<212> PRT 

<213> Homo sapiens 



<400> 3 
atgtcatgga 
attttctgtt 
tttgcagagc 
gcttttaatc 
gataatcagg 
gcctggggac 
gaagaagcaa 
atccccgttt 



ggatttcccc 
tggaaacagc 
ttgcacaaat 
tgaccactgg 
tgtatgtggc 
acagcaccgt 
tcgtgtattc 
ttagacagaa 



<400> 4 

Met Ser Trp Arg He Ser Pro Ala 

1 5 
Phe His Leu Cys He Phe Cys Leu 
20 

Leu Gly He Cys Tyr Asp Met Arg 

35 40 
Ala Gin Arg Gly Cys Gin Leu Leu 

50 55 
Thr Thr Gly Pro Ala His Trp Glu 
65 70 



Thr Pro Cys Cys Arg Glu Leu Thr 

10 15 
Glu Thr Ala Tyr Cys Arg Val Gly 
25 30 
Phe Ala Glu Leu Ala Gin He Tyr 
45 

Val Tyr Pro Gly Ala Phe Asn Leu 
60 

Leu Leu Gin Arg Ser Arg Ala Val 
75 80 
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Asp Asn Gin Val Tyr Val Ala Thr Ala Ser Pro Ala Arg Asp Asp Lys 

85 90 95 

Ala Ser Tyr Val Ala Trp Gly His Ser Thr Val Val Asn Pro Trp Gly 

100 105 110 

Glu Val Leu Ala Lys Ala Gly Thr Glu Glu Ala lie Val Tyr Ser Asp 

115 120 125 

lie Asp Leu Lys Lys Leu Ala Glu lie Arg Gin Gin lie Pro Val Phe 

130 135 140 

Arg Gin Lys Arg Ser Asp Leu Tyr Ala Val Glu Met Lys Lys Pro 
145 150 155 



<210> 5 

<211> 366 

<212> DNA 

<213> Homo ^sapiens 



<400> 5 

atgcggtttg cagagcttgc acaaatctac gcacagagag gctgccagct gttggtatat 60 

ccaggagctt ttaatctgac cactggacca gcccattggg agttacttca gcgaagccgg 120 

gctgttgata atcaggtgta tgtggccaca gcctctcctg cccgggatga caaagcctcc 180 

tatgttgcct ggggacacag caccgtggtg aacccttggg gggaggttct agccaaagct 240 

ggcacagaag aagcaatcgt gtattcagac atagacctga agaagctggc tgaaatacgc 300 

cagcaaatcc ccgtttttag acagaagcga tcagacctct atgctgtgga gatgaaaaag 360 

ccctaa 366 



<210> 6 

<211> 121 

<212> PRT 

<213> Homo sapiens 



<400> 6 
Met Arg Phe 
1 

Leu Leu Val 

Trp Glu Leu 
35 

Ala Thr Ala 
50 

Gly His Ser 
65 

Gly Thr Glu 

Ala Glu lie 

Leu Tyr Ala 
115 



Ala Glu 
5 

Tyr Pro 
20 

Leu Gin 

Ser Pro 

Thr Val 

Glu Ala 
85 

Arg Gin 
100 

Val Glu 



Leu Ala Gin lie 
Gly Ala 
Arg Ser 



Ala Arg 

55 
Val Asn 
70 

He Val 
Gin He 
Met Lys 



Phe Asn 
25 

Arg Ala 
40 

Asp Asp 



Pro Trp 

Tyr Ser 

Pro Val 
105 
Lys Pro 
120 



Tyr Ala Gin 
10 

Leu Thr Thr 

Val Asp Asn 

Lys Ala Ser 
60 

Gly Glu Val 
75 

Asp He Asp 
90 

Phe Arg Gin 



Arg Gly 

Gly Pro 
30 

Gin Val 
45 

Tyr Val 

Leu Ala 

Leu Lys 

Lys Arg 
110 



Cys Gin 
15 

Ala His 

Tyr Val 

Ala Trp 

Lys Ala 

80 
Lys Leu 
95 

Ser Asp 



<210> 7 
<211> 507 
<212> DNA 

<213> Homo sapiens 



<400> 7 

atgtcatgga ggatttcccc tgccacacca tgctgtaggg agttaacttt tcatttgtgc 60 
attttctgtt tggaaacagc ttactgcaga gtgggtctgg gcatctgcta cgacatgcgg 120 
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tttgcagagc ttgcacaaat ctacgcacag agaggctgcc agctgttggt atatccagga 180 

gcttttaatc tgaccactgg accagcccat tgggagttac ttcagcgaag ccgggctgtt 240 

gataatcagg tgtatgtggc cacagcctct cctgcccggg atgacaaagc ctcctatgtt 300 

gcctggggac acagcaccgt ggtgaaccct tggggggagg ttctagccaa agctggcaca 360 

gaagaagcaa tcgtgtattc agacatagac ctgaagaagc tggctgaaat acgccagcaa 420 

atccccgttt ttagacagaa gcgaaatatt ttcctgaata tgcagagaaa attcctggtg 480 

aatccacaca gaagctttct gaagtag 507 

<210> 8 
<211> 168 
<212> PRT 

<213> Homo sapiens 



<400> 8 

Met Ser Trp Arg lie Ser Pro Ala Thr Pro Cys Cys Arg 

1 5 10 

Phe His Leu Cys lie Phe Cys Leu Glu Thr Ala Tyr Cys 

20 25 
Leu Gly He Cys Tyr Asp Met Arg Phe Ala Glu Leu Ala 

35 40 45 

Ala Gin Arg Gly Cys Gin Leu Leu Val Tyr Pro Gly Ala 

50 55 60 

Thr Thr Gly Pro Ala His Trp Glu Leu Leu Gin Arg Ser 
65 70 75 

Asp Asn Gin Val Tyr Val Ala Thr Ala Ser Pro Ala Arg 

85 90 
Ala Ser Tyr Val Ala Trp Gly His Ser Thr Val Val Asn 

100 105 
Glu Val Leu Ala Lys Ala Gly Thr Glu Glu Ala lie Val 
115 120 125 

He Asp Leu Lys Lys Leu Ala Glu He Arg Gin Gin He 

130 135 140 

Arg Gin Lys Arg Asn He Phe Leu Asn Met Gin Arg Lys 
145 150 155 

Asn Pro His Arg Ser Phe Leu Lys 
165 



Glu Leu Thr 
15 

Arg Val Gly 
30 

Gin He Tyr 

Phe Asn Leu 

Arg Ala Val 
80 

Asp Asp Lys 
95 

Pro Trp Gly 
110 

Tyr Ser Asp 

Pro Val Phe 

Phe Leu Val 
160 



<210> 9 
<211> 393 
<212> DNA 
<213> Homo 



sapiens 



<400> 9 

atgcggtttg 

ccaggagctt 

gctgttgata 

tatgttgcct 

ggcacagaag 

cagcaaatcc 

ctggtgaatc 



cagagcttgc 
ttaatctgac 
atcaggtgta 
ggggacacag 
aagcaatcgt 
ccgtttttag 
cacacagaag 



acaaatctac 
cactggacca 
tgtggccaca 
caccgtggtg 
gtattcagac 
acagaagcga 
ctttctgaag 



gcacagagag 
gcccattggg 
gcctctcctg 
aacccttggg 
atagacctga 
aatattttcc 
tag 



gctgccagct 
agttacttca 
cccgggatga 
gggaggttct 
agaagctggc 
tgaatatgca 



gttggtatat 
gcgaagccgg 
caaagcctcc 
agccaaagct 
tgaaatacgc 
gagaaaattc 



60 
120 
180 
240 
300 
360 
393 



<210> 10 
<211> 130 
<212> PRT 
<213> Homo 



sapiens 
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<400> 10 

Met Arg Phe Ala Glu Leu Ala Gin He Tyr Ala Gin Arg Gly Cys Gin 

15 10 15 

Leu Leu Val Tyr Pro Gly Ala Phe Asn Leu Thr Thr Gly Pro Ala His 

20 25 30 

Trp Glu Leu Leu Gin Arg Ser Arg Ala Val Asp Asn Gin Val Tyr Val 

35 40 45 

Ala Thr Ala Ser Pro Ala Arg Asp Asp Lys Ala Ser Tyr Val Ala Trp 

50 55 60 

Gly His Ser Thr Val Val Asn Pro Trp Gly Glu Val Leu Ala Lys Ala 
65 70 75 80 

Gly Thr Glu Glu Ala He Val Tyr Ser Asp He Asp Leu Lys Lys Leu 

85 90 95 

Ala Glu He Arg Gin Gin He Pro Val Phe Arg Gin Lys Arg Asn He 

- 100 105 110 

Phe Leu Asn Met Gin Arg Lys Phe Leu Val Asn Pro His Arg Ser Phe 
115 120 125 

Leu Lys 
130 

<210> 11 
<211> 459 
<212> DNA 

<213> Homo sapiens 
<400> 11 

atgacctctt tccgcttggc cctcatccag cttcagattt cttccatcaa atcagataac 60 
gtcactcgcg cttgtagctt catccgggag gcagcaacgc aaggagccaa aatagtttct 120 
ttgccggaat gctttaattc tccatatgga gcgaaatatt ttcctgaata tgcagagaaa 180 
attcctggtg aatccacaca gaagctttct gaagtagcaa aggaatgcag catatatctc 240 
attggaggct ctatccctga agaggatgct gggaaattat ataacacctg tgctgtgttt 300 
gggcctgatg gaactttact agcaaagtat agaaagatcc atctgtttga cattgatgtt 360 
cctggaaaaa ttacatttca agaatctaaa acattgagtc cgggtgatag tttctccaca 420 
tttgatactc gtatgtacca gataagtttg cctctttag 459 

<210> 12 
<211> 152 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Thr Ser Phe Arg Leu Ala Leu He Gin Leu Gin He Ser Ser He 

15 10 15 

Lys Ser Asp Asn Val Thr Arg Ala Cys Ser Phe He Arg Glu Ala Ala 

20 25 30 

Thr Gin Gly Ala Lys He Val Ser Leu Pro Glu Cys Phe Asn Ser Pro 

35 40 45 

Tyr Gly Ala Lys Tyr Phe Pro Glu Tyr Ala Glu Lys lie Pro Gly Glu 

50 55 60 

Ser Thr Gin Lys Leu Ser Glu Val Ala Lys Glu Cys Ser He Tyr Leu 
65 70 75 80 

He Gly Gly Ser He Pro Glu Glu Asp Ala Gly Lys Leu Tyr Asn Thr 

85 90 95 

Cys Ala Val Phe Gly Pro Asp Gly Thr Leu Leu Ala Lys Tyr Arg Lys 

100 105 110 

He His Leu Phe Asp He Asp Val Pro Gly Lys He Thr Phe Gin Glu 
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115 120 125 

Ser Lys Thr Leu Ser Pro Gly Asp Ser Phe Ser Thr Phe Asp Thr Arg 

130 135 140 

Met Tyr Gin lie Ser Leu Pro Leu 
145 150 



<210> 13 

<211> 858 

<212> DNA 

<213> Homo sapiens 



<400> 13 

atgacctctt tccgcttggc cctcatccag cttcagattt cttccatcaa atcagataac 60 

gtcactcgcg cttgtagctt catccgggag gcagcaacgc aaggagccaa aatagtttct 120 

ttgccggaat *gctttaattc tccatatgga gcgaaatatt ttcctgaata tgcagagaaa 180 

attcctggtg aatccacaca gaagctttct gaagtagcaa aggaatgcag catatatctc 240 

attggaggct ctatccctga agaggatgct gggaaattat ataacacctg tgctgtgttt 300 

gggcctgatg gaactttact agcaaagtat agaaagatcc atctgtttga cattgatgtt 360 

cctggaaaaa ttacatttca agaatctaaa acattgagtc cgggtgatag tttctccaca 420 

tttgatactc cttactgcag agtgggtctg ggcatctgct acgacatgcg gtttgcagag 480 

cttgcacaaa tctacgcaca gagaggctgc cagctgttgg tatatccagg agcttttaat 540 

ctgaccactg gaccagccca ttgggagtta cttcagcgaa gccgggctgt tgataatcag 600 

gtgtatgtgg ccacagcctc tcctgcccgg gatgacaaag cctcctatgt tgcctgggga 660 

cacagcaccg tggtgaaccc ttggggggag gttctagcca aagctggcac agaagaagca 720 

atcgtgtatt cagacataga cctgaagaag ctggctgaaa tacgccagca aatccccgtt 780 

tttagacaga agcgaaatat tttcctgaat atgcagagaa aattcctggt gaatccacac 840 

agaagctttc tgaagtag 858 



<210> 14 

<211> 285 

<212> PRT 

<213> Homo sapiens 



<400> 14 
Met Thr Ser Phe 
1 

Lys Ser Asp Asn 
20 

Thr Gin Gly Ala 
35 

Tyr Gly Ala Lys 
50 

Ser Thr Gin Lys 
65 

He Gly Gly Ser 

Cys Ala Val Phe 
100 

He His Leu Phe 
115 

Ser Lys Thr Leu 
130 

Tyr Cys Arg Val 
145 

Leu Ala Gin He 



Arg Leu Ala Leu 
5 

Val Thr Arg Ala 

Lys He Val Ser 
40 

Tyr Phe Pro Glu 
55 

Leu Ser Glu Val 
70 

He Pro Glu Glu 
85 

Gly Pro Asp Gly 

Asp He Asp Val 
120 

Ser Pro Gly Asp 
135 

Gly Leu Gly He 
150 

Tyr Ala Gin Arg 
165 



He Gin Leu Gin 
10 

Cys Ser Phe He 
25 

Leu Pro Glu Cys 

Tyr Ala Glu Lys 
60 

Ala Lys Glu Cys 
75 

Asp Ala Gly Lys 
90 

Thr Leu Leu Ala 
105 

Pro Gly Lys He 

Ser Phe Ser Thr 
140 

Cys Tyr Asp Met 
155 

Gly Cys Gin Leu 
170 



lie Ser Ser He 
15 

Arg Glu Ala Ala 
30 

Phe Asn Ser Pro 
45 

He Pro Gly Glu 

Ser He Tyr Leu 
80 

Leu Tyr Asn Thr 
95 

Lys Tyr Arg Lys 
110 

Thr Phe Gin Glu 
125 

Phe Asp Thr Pro 

Arg Phe Ala Glu 
160 

Leu Val Tyr Pro 
175 
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Gly Ala Phe Asn Leu Thr Thr Gly Pro Ala His Trp Glu Leu Leu Gin 

180 185 190 

Arg Ser Arg Ala Val Asp Asn Gin Val Tyr Val Ala Thr Ala Ser Pro 

195 200 205 

Ala Arg Asp Asp Lys Ala Ser Tyr Val Ala Trp Gly His Ser Thr Val 

210 215 220 

Val Asn Pro Trp Gly Glu Val Leu Ala Lys Ala Gly Thr Glu Glu Ala 

225 230 235 240 
lie Val Tyr Ser Asp He Asp Leu Lys Lys Leu Ala Glu He Arg Gin 

245 250 255 
Gin He Pro Val Phe Arg Gin Lys Arg Asn He Phe Leu Asn Met Gin 

260 265 270 

Arg Lys Phe Leu Val Asn Pro His Arg Ser Phe Leu Lys 
275 280 285 

<210> 15 
<211> 3093 
<212> DNA 

<213> Homo sapiens 
<400> 15 

ggatggtggg gcatacctgt ggtcccagct acataagagg ctgagacaag aggattgcct 60 

gaactgagta ggtcaaggct gcagtggacc atgtttgtgc cactgcactc cagcctgggc 120 

gacagaacaa ggccctgcct caaaataaaa aatattagct aatggaaagt gattatcata 180 

aaagctaaaa gggaacttta aagaacagaa gaaaagcaaa tatgatgtat agctactacc 240 

tccaggaaga aataagcttg gaagagcccc caacctcctt gctccagggc tgagcacaga 300 

ccttgtcagg gctggctaca taatttgtgg ggcccagttc ccttgttcag atagcaagag 360 

aaaagtgctg ttagcttttc cttctgcagt atctctttca acctctcatg gtgttatttg 420 

ctgtttaatg tcatgttctc ttggacacat gaatacttat ggggtaagtg cagactttta 480 

gaggtgcctg ggacccctgt cctgtgaata ggcatgtgtg cagctcactg gctgccaggt 540 

tttccctctg ccagcagcgg gatcgatgtg ctgtgaccca gccagtagtg gggaaactga 600 

gacagacatc ttcccttccc atgagctggg cctgctcatg ggaattatgt gagcagcttc 660 

caaggaatca cactttctgt gctgggacat actcaagtat atggattgga ggtagacgag 720 

aggcccattg aacaaacagt aagggacagg accatattca aacccagtct ttttacttta 780 

agccatattc ctcatttcat tcccctacac tgcgtagtaa gaagctggtt cactctagat 840 

tcttgtgcct ggcatgggac tttgcccatg gatattgctc tatctccaga tagattttag 900 

actattgaca ttttggacag gataattctt cgttgtgtta tggagggggt tgtcctatgc 960 

attgtaggat gtttggcagt atccttggtc tctattcatt agatgccact catacctcat 1020 

cagttgtggc atcaaaggta tcttcagaca ttgtcagatg tccccccggg gacataactg 1080 

ccttccattt gagaactatg gctctgtctg aatccagcag ttcgatcttc tgatagctgt 1140 

tttcttttgt ctttgttctc agcccccccc cccccggtag gacccgcggt ccgccggatc 1200 

tccagcgctc agtccgcgcc gcaggtggtg cttgtctgca gagtcatgac ctctttccgc 1260 

ttggccctca tccagcttca. gatttcttcc atcaaatcag ataacgtcac tcgcgcttgt 1320 

agcttcatcc gggaggcagc aacgcaagga gccaaaatag tttctttgcc ggaatgcttt 1380 

aattctccat atggagcgaa atattttcct gaatatgcag agaaaattcc tggtgaatcc 1440 

acacagaagc tttctgaagt agcaaaggaa tgcagcatat atctcattgg aggctctatc 1500 

cctgaagagg atgctgggaa attatataac acctgtgctg tgtttgggcc tgatggaact 1560 

ttactagcaa agtatagaaa gatccatctg tttgacattg atgttcctgg aaaaattaca 1620 

tttcaagaat ctaaaacatt gagtccgggt gatagtttct ccacatttga tactcgtatg 1680 

taccagataa gtttgcctct ttagcaatct cagtagaaga caatcaggta tttatttctt 1740 

ttttgtctct ctccgatttc ttcacataac ctaactgaaa gaccataagt gagaaaggca 1800 

gagaatcatc acagatctgg aaagttcggg cttatttgag aactaaggat ttgacacgat 1860 

tttgcccttt gatttgattg tagcttcctg ttacggcttc cagagtatac ctattaggct 1920 

acagttgagt acctcccatc tagataataa gcattcaatt agaatgaatt tctcatcttt 1980 

actccgctga tgtaaatgat gtctttatga gatgaagtcc aagtaggaat gagcttgtaa 2040 

attatctctg tcctcaggtc ctgtgttaat ttatccctgt cagtgttttg tgatcattat 2100 
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gtcatggagg atttcccctg ccacaccatg ctgtagggag ttaacttttc atttgtgcat 2160 

tttctgtttg gaaacagctt actgcagagt gggtctgggc atctgctacg acatgcggtt 2220 

tgcagagctt gcacaaatct acgcacagag aggctgccag ctgttggtat atccaggagc 2280 

ttttaatctg accactggac cagcccattg ggagttactt cagcgaagcc gggctgttga 2340 

taatcaggtg tatgtggcca cagcctctcc tgcccgggat gacaaagcct cctatgttgc 2400 

ctggggacac agcaccgtgg tgaacccttg gggggaggtt ctagccaaag ctggcacaga 2460 

agaagcaatc gtgtattcag acatagacct gaagaagctg gctgaaatac gccagcaaat 2520 

ccccgttttt agacagaagc gatcagacct ctatgctgtg gagatgaaaa agccctaaag 2580 

tttatgtttc taatgtgtca cagaatagga cgatatgatt ctacaacata atcaactccc 2640 

tattaaattc tttaatgaag aaaaaaaaaa aaaaaaaaaa aaaaaatatt ttcctgaata 2700 

tgcagagaaa attcctggtg aatccacaca gaagctttct gaagtagcaa aggaatgcag 2760 

catatatctc attggaggct ctatccctga agaggatgct gggaaattat ataacacctg 2820 

tgctgtgttt gggcctgatg gaactttact agcaaagtat agaaagatcc atctgtttga 2880 

cattgatgtt cctggaaaaa ttacatttca agaatctaaa acattgagtc cgggtgatag 2940 

tttctccaca -tttgatactc cttactgcag agtgggtctg ggcatctgct acgacatgcg 3000 

gtttgcagag cttgcacaaa tctacgcaca gagaggctgc cagctgttgg tatatccagg 3060 

agcttttaat ctgaccactg gaccagccca ttg 3093 
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NOVEL HUMAN ENZYMES AND 
POLYNUCLEOTIDES ENCODING THE SAME 

The present application claims the benefit of U.S. Provisional 
5 Application Number 60/179,000 which was filed on January 28, 2000 
and is herein incorporated by reference in its entirety. 



1 . INTRODUCTION 
The present invention relates to the discovery, 

10 identification, and characterization of novel human 

polynucleotides encoding proteins sharing sequence similarity with 
mammalian enzymes. The invention encompasses the described 
polynucleotides, host cell expression systems, the encoded 
protein, fusion proteins, polypeptides and peptides, antibodies to 

15 the encoded proteins and peptides, and sequencetically engineered 
animals that either lack or over express the disclosed sequences, 
antagonists and agonists of the proteins, and other compounds that 
modulate the expression or activity of the proteins encoded by the 
disclosed polynucleotides that can be used for diagnosis, drug 

20 screening, clinical trial monitoring and the treatment of 
physiological disorders . 

2 . BACKGROUND OF THE INVENTION 
Enzymes are biological catalysts that modify biological 
25 substrates, including proteins, as part of degradation, 

maturation, catabolic, metabolic, differentiation, and secretory 
pathways within the body. Enzyme abnormalities have thus been 
associated with, inter alia, growth, development, protein and 
cellular senescence, cancer, or other diseases. 

30 

3 . SUMMARY OF THE INVENTION 
The present invention relates to the discovery, 
identification, and characterization of nucleotides that encode 
novel human proteins, and the corresponding amino acid sequences 
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of these proteins. The novel human proteins (NHPs) described for 
the first time herein share structural similarity with nitrilase 
proteins from a wide variety of living organisms. 

The novel human nucleic acid (cDNA) sequences described 
5 herein, encode proteins/open reading frames (ORFs) of 276, 159, 
121, 168, 130, 152, and 285 amino acids in length (see 
respectively SEQ ID NOS : 2, 4, 6, 8, 10, 12 and 14) . 

The invention also encompasses agonists and antagonists of 
the described NHPs including small molecules, large molecules, 

10 mutant NHPs, or portions thereof that compete with native NHPs, 

NHP peptides, and antibodies, as well as nucleotide sequences that 
can be used to inhibit the expression of the described NHPs (e.g., 
antisense and ribozyme molecules, and gene or regulatory sequence 
replacement constructs) or to enhance the expression of the 

15 described NHPs (e.g., expression constructs that place the 

described sequence under the control of a strong promoter system) , 
and transgenic animals that express a NHP transgene, or "knock- 
outs" (which can be conditional) that do not express a functional 
NHP. 

20 Further, the present invention also relates to processes for 

identifying compounds that modulate, i.e., act as agonists or 
antagonists, of NHP expression and/or NHP activity that utilize 
purified preparations of the described NHP and/or NHP product, or 
cells expressing the same. Such compounds can be used as 

25 therapeutic agents for the treatment of any of a wide variety of 
symptoms associated with biological disorders or imbalances. 



4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
The Sequence Listing provides the sequences of the NHP ORFs 
3 0 encoding the described NHP amino acid sequences. SEQ ID NOS: 15 
describe representative a nitrilase-like ORF with flanking 
sequences . 
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5. DETAILED DESCRIPTION OF THE INVENTION 
The NHPs, described for the first time herein, are novel 
proteins that are expressed in, inter alia, human cell lines, gene 
trapped cells and human brain, fetal brain, pituitary, cerebellum, 
5 spinal cord, thymus, spleen, lymph node, bone marrow, trachea, 
lung, kidney, fetal liver, liver, prostate, testis, thyroid, 
adrenal gland, pancreas, salivary gland, stomach, small intestine, 
colon, skeletal muscle, heart, placenta, mammary gland, adipose, 
skin, esophagus, bladder, pericardium, hypothalamus, ovary, fetal 

10 kidney, and fetal lung cells. 

The described sequences were compiled from gene trapped cDNAs 
and clones isolated from human prostate, lymph node, pituitary, 
mammary gland, and kidney cDNA library (Edge Biosystems, 
Gaithersburg, MD) . The present invention encompasses the 

15 nucleotides presented in the Sequence Listing, host cells 

expressing such nucleotides, the expression products of such 
nucleotides, and: (a) nucleotides that encode mammalian homologs 
of the described sequences, including the specifically described 
NHPs, and the NHP products; (b) nucleotides that encode one or 

2 0 more portions of a NHP that correspond to functional domains of 

the NHP, and the polypeptide products specified by such nucleotide 
sequences, including but not limited to the novel regions of any 
active domain(s); (c) isolated nucleotides that encode mutant 
versions, engineered or naturally occurring, of a described NHP in 

25 which all or a part of at least one domain is deleted or altered, 
and the polypeptide products specified by such nucleotide 
sequences, including but not limited to soluble proteins and 
peptides in which all or a portion of the signal sequence is 
deleted; (d) nucleotides that encode chimeric fusion proteins 

30 containing all or a portion of a coding region of a NHP, or one of 
its domains (e.g., a receptor or ligand binding domain, accessory 
protein/self -association domain, etc.) fused to another peptide or 
polypeptide; or (e) therapeutic or diagnostic derivatives of the 
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described polynucleotides such as oligonucleotides, antisense 
polynucleotides, ribozymes, dsRNA, or gene therapy constructs 
comprising a sequence first disclosed in the Sequence Listing. 

As discussed above, the present invention includes: (a) the 
5 human DNA sequences presented in the Sequence Listing (and vectors 
comprising the same) and additionally contemplates any nucleotide 
sequence encoding a contiguous NHP open reading frame (ORF) , or a 
contiguous exon splice junction first described in the Sequence 
Listing, that hybridizes to a complement of a DNA sequence 

10 presented in the Sequence Listing under highly stringent 

conditions, e.g., hybridization to filter-bound DNA in 0.5 M 
NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and 
washing in 0 . IxSSC/O . 1% SDS at 68°C (Ausubel F.M. et al . , eds . , 
1989, Current Protocols in Molecular Biology, Vol. I, Green 

15 Publishing Associates, Inc., and John Wiley & sons, Inc., New 
York, at p. 2.10.3) and encodes a functionally equivalent gene 
product. Additionally contemplated are any nucleotide sequences 
that hybridize to the complement of the DNA sequence that encode 
and express an amino acid sequence presented in the Sequence 

2 0 Listing under moderately stringent conditions, e.g., washing in 
0.2xSSC/0.1% SDS at 42°C (Ausubel et al . , 1989, supra), yet still 
encode a functionally equivalent NHP product. Functional 
equivalents of a NHP include naturally occurring NHPs present in 
other species and mutant NHPs whether naturally occurring or 

25 engineered (by site directed mutagenesis, gene shuffling, directed 
evolution as described in, for example, U.S. Patent No. 
5,837,458). The invention also includes degenerate nucleic acid 
variants of the disclosed NHP polynucleotide sequences. 

Additionally contemplated are polynucleotides encoding a NHP 

30 ORF, or its functional equivalent, encoded by a polynucleotide 

sequence that is about 99, 95, 90, or about 85 percent similar or 
identical to corresponding regions of the nucleotide sequences of 
the Sequence Listing (as measured by BLAST sequence comparison 
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analysis using, for example, the GCG sequence analysis package 
using standard default settings) . 

The invention also includes nucleic acid molecules, 
preferably DNA molecules, that hybridize to, and are therefore the 
5 complements of, the described NHP nucleotide sequences. Such 
hybridization conditions may be highly stringent or less highly 
stringent, as described above. In instances where the nucleic 
acid molecules are deoxyoligonucleotides ("DNA oligos"), such 
molecules are generally about 16 to about 100 bases long, or about 

10 20 to about 80, or about 34 to about 45 bases long, or any 
variation or combination of sizes represented therein that 
incorporate a contiguous region of sequence first disclosed in the 
Sequence Listing. Such oligonucleotides can be used in 
conjunction with the polymerase chain reaction (PCR) to screen 

15 libraries, isolate clones, and prepare cloning and sequencing 
templates, etc. 

Alternatively, such NHP oligonucleotides can be used as 
hybridization probes for screening libraries, and assessing gene 
expression patterns (particularly using a micro array or high- 

20 throughput "chip" format) . Additionally, a series of the 

described NHP oligonucleotide sequences, or the complements 
thereof, can be used to represent all or a portion of the 
described NHP sequences. An oligonucleotide or polynucleotide 
sequence first disclosed in at least a portion of one or more of 

25 the sequences of SEQ ID NOS : 1-15 can be used as a hybridization 
probe in conjunction with a solid support matrix/substrate 
(resins, beads, membranes, plastics, polymers, metal or metallized 
substrates, crystalline or polycrystalline substrates, etc.). Of 
particular note are spatially addressable arrays (i.e., gene 

30 chips, microtiter plates, etc.) of oligonucleotides and 

polynucleotides, or corresponding oligopeptides and polypeptides, 
wherein at least one of the biopolymers present on the spatially 
addressable array comprises an oligonucleotide or polynucleotide 
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sequence first disclosed in at least one of the sequences of SEQ 
ID NOS: 1-15, or an amino acid sequence encoded thereby. Methods 
for attaching biopolymers to, or synthesizing biopolymers on, 
solid support matrices, and conducting binding studies thereon are 
5 disclosed in, .inter alia, U.S. Patent Nos . 5,700,637, 5,556,752, 
5,744,3 05, 4, 631,211, 5,445,934, 5,2 52,743, 4,713,326, 5,424,186, 
and 4,689,405 the disclosures of which are herein incorporated by 
reference in their entirety. 

Addressable arrays comprising sequences first disclosed in 

10 SEQ ID NOS: 1-15 can be used to identify and characterize the 
temporal and tissue specific expression of a gene. These 
addressable arrays incorporate oligonucleotide sequences of 
sufficient length to confer the required specificity, yet be 
within the limitations of the production technology. The length 

15 of these probes is within a range of between about 8 to about 2000 
nucleotides. Preferably the probes consist of 60 nucleotides and 
more preferably 25 nucleotides from the sequences first disclosed 
in SEQ ID NOS: 1-15. 

For example, a series of the described oligonucleotide 

2 0 sequences, or the complements thereof, can be used in chip format 
to represent all or a portion of the described sequences. The 
oligonucleotides, typically between about 16 to about 40 (or any 
whole number within the stated range) nucleotides in length can 
partially overlap each other and/or the sequence may be 

25 represented using oligonucleotides that do not overlap. 

Accordingly, the described polynucleotide sequences shall 
typically comprise at least about two or three distinct 
oligonucleotide sequences of at least about 8 nucleotides in 
length that are each first disclosed in the described Sequence 

30 Listing. Such oligonucleotide sequences can begin at any 

nucleotide present within a sequence in the Sequence Listing and 
proceed in either a sense (5'-to-3') orientation vis-a-vis the 
described sequence or in an antisense orientation. 
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Microarray-based analysis allows the discovery of broad 
patterns of genetic activity, providing new understanding of gene 
functions and generating novel and unexpected insight into 
transcriptional processes and biological mechanisms. The use of 
addressable arrays comprising sequences first disclosed in SEQ ID 
N0S:1-15 provides detailed information about transcriptional 
changes involved in a specific pathway, potentially leading to the 
identification of novel components or gene functions that manifest 
themselves as novel phenotypes . 

Probes consisting of sequences first disclosed in SEQ ID 
N0S:1-15 can also be used in the identification, selection and 
validation of novel molecular targets for drug discovery. The use 
of these unique sequences permits the direct confirmation of drug 
targets and recognition of drug dependent changes in gene 
expression that are modulated through pathways distinct from the 
drugs intended target. These unique sequences therefore also have 
utility in defining and monitoring both drug action and toxicity. 

As an example of utility, the sequences first disclosed in 
SEQ ID N0S:1-15 can be utilized in microarrays or other assay 
formats, to screen collections of genetic material from patients 
who have a particular medical condition. These investigations can 
also be carried out using the sequences first disclosed in SEQ ID 
N0S:1-15 in silico and by comparing previously collected genetic 
databases and the disclosed sequences using computer software 
known to those in the art. 

Thus the sequences first disclosed in SEQ ID N0S:1-15 can be 
used to identify mutations associated with a particular disease 
and also as a diagnostic or prognostic assay. 

Although the presently described sequences have been 
specifically described using nucleotide sequence, it should be 
appreciated that each of the sequences can uniquely be described 
using any of a wide variety of additional structural attributes, 
or combinations thereof. For example, a given sequence can be 
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described by the net composition of the nucleotides present within 
a given region of the sequence in conjunction with the presence of 
one or more specific oligonucleotide sequence (s) first disclosed 
in the SEQ ID NOS : 1-15. Alternatively, a restriction map 
5 specifying the relative positions of restriction endonuclease 
digestion sites, or various palindromic or other specific 
oligonucleotide sequences can be used to structurally describe a 
given sequence. Such restriction maps, which are typically 
generated by widely available computer programs {e.g., the 

10 University of Wisconsin GCG sequence analysis package, SEQUENCHER 
3.0, Gene Codes Corp., Ann Arbor, MI, etc.), can optionally be 
used in conjunction with one or more discrete nucleotide 
sequence (s) present in the sequence that can be described by the 
relative position of the sequence relatve to one or more 

15 additional sequence (s) or one or more restriction sites present in 
the disclosed sequence. 

For oligonucleotide probes, highly stringent conditions may 
refer, for example, to washing in 6xSSC/0.05% sodium pyrophosphate 
at 37°C (for 14-base oligos), 48°C (for 17-base oligos) , 55°C (for 

20 20-base oligos), and 60°C (for 23-base oligos). These nucleic 

acid molecules may encode or act as NHP gene antisense molecules, 
useful, for example, in NHP gene regulation (for and/or as 
antisense primers in amplification reactions of NHP nucleic acid 
sequences) . With respect to NHP gene regulation, such techniques 

25 can be used to regulate biological functions. Further, such 
sequences may be used as part of ribozyme and/or triple helix 
sequences that are also useful for NHP gene regulation. 

Inhibitory antisense or double stranded oligonucleotides can 
additionally comprise at least one modified base moiety which is 

30 selected from the group including but not limited to 

5-f luorouracil , 5-bromouracil , 5-chlorouracil , 5-iodouracil , 
hypoxanthine, xantine, 4-acetylcytosine , 5- ( carboxyhydroxylmethyl ) 
uracil , 5-carboxymethylaminomethyl-2-thiouridine , 
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5-carboxymethylaminomethyluracil , dihydrouracil , beta-D- 
galactosylqueosine , inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine , 2 , 2 -dimethyl guanine, 

2 - methyladenine , 2 -methy Iguanine , 3 -methylcy tosine , 
5 5-methylcytosine , N6-adenine, 7 -me thy Iguanine, 

5 -me thy laminomethy lurac i 1 , 5 -me thoxyaminomethy 1 -2 - thiouraci 1 , 
beta-D-mannosylqueosine , 5'-methoxycarboxymethyluracil , 
5-methoxyuracil , 2 -methyl thio-N6-isopentenyladenine , uracil- 5- 
oxyacetic acid (v) , wybutoxosine , pseudouracil , queosine, 

10 2-thiocytosine, 5-methyl-2-thiouracil , 2-thiouracil , 4-thiouracil , 
5-methyluracil , uracil-5-oxyacetic acid methylester, uracil- 
5-oxyacetic acid (v) , 5 -methyl -2- thiouracil , 3- (3-amino-3-N-2- 
carboxypropyl ) uracil, (acp3)w, and 2 , 6-diaminopurine . 

The antisense oligonucleotide can also comprise at least one 

15 modified sugar moiety selected from the group including but not 
limited to arabinose, 2 - f luoroarabinose , xylulose, and hexose. 

In yet another embodiment, the antisense oligonucleotide will 
comprise at least one modified phosphate backbone selected from 
the group consisting of a phosphorothioate , a phosphorodithioate, 

2 0 a phosphoramidothioate , a phosphoramidate, a phosphordiamidate , a 
methylphosphonate, an alkyl phosphotriester , and a formacetal or 
analog thereof. 

In yet another embodiment, the antisense oligonucleotide is 
an a-anomeric oligonucleotide. An a-anomeric oligonucleotide 

2 5 forms specific double- stranded hybrids with complementary RNA in 
which, contrary to the usual (3 -units, the strands run parallel to 
each other (Gautier et al . , 1987, Nucl. Acids Res. 15:6625-6641). 
The oligonucleotide is a 2'-0-methylribonucleotide (Inoue et al . , 
1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA-DNA 

30 analogue (Inoue et al . , 1987, FEBS Lett. 215:327-330). 

Alternatively, double stranded RNA can be used to disrupt the 
expression and function of a targeted NHP . 
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Oligonucleotides of the invention can be synthesized by- 
standard methods known in the art, e.g. by use of an automated DNA 
synthesizer (such as are commercially available from Biosearch, 
Applied Biosystems, etc.). As examples, phosphorothioate 
5 oligonucleotides can be synthesized by the method of Stein et al . 
(1988, Nucl. Acids Res. 25:3209), and methylphosphonate 
oligonucleotides can be prepared by use of controlled pore glass 
polymer supports (Sarin et al . , 1988, Proc . Natl. Acad. Sci . 
U.S.A. 85:744 8-7 451) , etc. 

10 Low stringency conditions are well known to those of skill in 

the art, and will vary predictably depending on the specific 
organisms from which the library and the labeled sequences are 
derived. For guidance regarding such conditions see, for example, 
Sambrook et al . , 1989, Molecular Cloning, A Laboratory Manual (and 

15 periodic updates thereof) , Cold Springs Harbor Press, N.Y. ; and 
Ausubel et al . , 1989, Current Protocols in Molecular Biology, 
Green Publishing Associates and Wiley Interscience, N.Y. 

Alternatively, suitably labeled NHP nucleotide probes can be 
used to screen a human genomic library using appropriately 

20 stringent conditions or by PCR. The identification and 
characterization of human genomic clones is helpful for 
identifying polymorphisms (including, but not limited to, 
nucleotide repeats, microsatellite alleles, single nucleotide 
polymorphisms, or coding single nucleotide polymorphisms), 

2 5 determining the genomic structure of a given locus/allele , and 

designing diagnostic tests. For example, sequences derived from 
regions adjacent to the intron/exon boundaries of the human gene 
can be used to design primers for use in amplification assays to 
detect mutations within the exons , introns , splice sites {e.g., 

30 splice acceptor and/or donor sites), etc., that can be used in 
diagnostics and pharmacogenomics . 

Further, a NHP homolog can be isolated from nucleic acid from 
an organism of interest by performing PCR using two degenerate or 
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"wobble" oligonucleotide primer pools designed on the basis of 
amino acid sequences within the NHP products disclosed herein. 
The template for the reaction may be total RNA, mRNA, and/or cDNA 
obtained by reverse transcription of mRNA prepared from human or 
5 non-human cell lines or tissue known or suspected to express an 
allele of a NHP gene. 

The PCR product can be subcloned and sequenced to ensure that 
the amplified sequences represent the sequence of the desired NHP 
sequence. The PCR fragment can then be used to isolate a full 

10 length cDNA clone by a variety of methods. For example, the 
amplified fragment can be labeled and used to screen a cDNA 
library, such as a bacteriophage cDNA library. Alternatively, the 
labeled fragment can be used to isolate genomic clones via the 
screening of a genomic library. 

15 PCR technology can also be used to isolate full length cDNA 

sequences. For example, RNA can be isolated, following standard 
procedures, from an appropriate cellular or tissue source (i.e., 
one known, or suspected, to express a NHP sequence, such as, for 
example, testis tissue) . A reverse transcription (RT) reaction 

20 can be performed on the RNA using an oligonucleotide primer 

specific for the most 5 ' end of the amplified fragment for the 
priming of first strand synthesis. The resulting RNA/DNA hybrid 
may then be "tailed" using a standard terminal transferase 
reaction, the hybrid may be digested with RNase H, and second 

25 strand synthesis may then be primed with a complementary primer. 
Thus, cDNA sequences upstream of the amplified fragment can be 
isolated. For a review of cloning strategies that can be used, 
see e.g., Sambrook et al . , 1989, supra. 

A cDNA encoding a mutant NHP sequence can be isolated, for 

3 0 example, by using PCR. In this case, the first cDNA strand may be 
synthesized by hybridizing an oligo-dT oligonucleotide to mRNA 
isolated from tissue known or suspected to be expressed in" an 
individual putatively carrying a mutant NHP allele, and by 
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extending the new strand with reverse transcriptase. The second 
strand of the cDNA is then synthesized using an oligonucleotide 
that hybridizes specifically to the 5' end of the normal gene. 
Using these two primers, the product is then amplified via PCR, 
5 optionally cloned into a suitable vector, and subjected to DNA 

sequence analysis through methods well known to those of skill in 
the art. By comparing the DNA sequence of the mutant NHP allele 
to that of a corresponding normal NHP allele, the mutation (s) 
responsible for the loss or alteration of function of the mutant 

10 NHP gene product can be ascertained. 

Alternatively, a genomic library can be constructed using DNA 
obtained from an individual suspected of or known to carry a 
mutant NHP allele (e.g., a person manifesting a NHP-associated 
phenotype such as, for example, obesity, high blood pressure, 

15 connective tissue disorders, infertility, etc.), or a cDNA library 
can be constructed using RNA from a tissue known, or suspected, to 
express a mutant NHP allele. A normal NHP sequence, or any 
suitable fragment thereof, can then be labeled and used as a probe 
to identify the corresponding mutant NHP allele in such libraries. 

20 Clones containing mutant NHP sequences can then be purified and 

subjected to sequence analysis according to methods well known to 
those skilled in the art. 

Additionally, an expression library can be constructed 
utilizing cDNA synthesized from, for example, RNA isolated from a 

25 tissue known, or suspected, to express a mutant NHP allele in an 
individual suspected of or known to carry such a mutant allele. 
In this manner, gene products made by the putatively mutant tissue 
can be expressed and screened using standard antibody screening 
techniques in conjunction with antibodies raised against normal 

30 NHP product, as described below. (For screening techniques, see, 
for example, Harlow, E. and Lane, eds . , 1988, "Antibodies: A 
Laboratory Manual", Cold Spring Harbor Press, Cold Spring Harbor.) 



12 



LEX-0118-USA 



Exhibit B 

Page 1 5 of 43 



Additionally, screening can be accomplished by screening with 
labeled NHP fusion proteins, such as, for example, alkaline 
phosphatase-NHP or NHP-alkaline phosphatase fusion proteins. 
In cases where a NHP mutation results in an expressed gene product 
5 with altered function (e.g., as a result of a missense or a 

frameshift mutation) , polyclonal antibodies to NHP are likely to 
cross-react with a corresponding mutant NHP gene product. Library 
clones detected via their reaction with such labeled antibodies 
can be purified and subjected to sequence analysis according to 

10 methods well known in the art. 

The invention also encompasses (a) DNA vectors that contain 
any of the foregoing NHP coding sequences and/or their complements 
(i.e., antisense) ; (b) DNA expression vectors that contain any of 
the foregoing NHP coding sequences operatively associated with a 

15 regulatory element that directs the expression of the coding 

sequences (for example, baculo virus as described in U.S. Patent 
No. 5,869,336 herein incorporated by reference); (c) genetically 
engineered host cells that contain any of the foregoing NHP coding 
sequences operatively associated with a regulatory element that 

20 directs the expression of the coding sequences in the host cell; 
and (d) genetically engineered host cells that express an 
endogenous NHP sequence under the control of an exogenously 
introduced regulatory element (i.e., gene activation). As used 
herein, regulatory elements include, but are not limited to, 

2 5 inducible and non- inducible promoters, enhancers, operators and 
other elements known to those skilled in the art that drive and 
regulate expression. Such regulatory elements include but are not 
limited to the cytomegalovirus (hCMV) immediate early gene, 
regulatable, viral elements (particularly retroviral LTR 

30 promoters) , the early or late promoters of SV40 adenovirus, the 
lac system, the trp system, the TAC system, the TRC system, the 
major operator and promoter regions of phage lambda, the control 
regions of fd coat protein, the promoter for 3-phosphoglycerate 
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kinase (PGK) , the promoters of acid phosphatase, and the promoters 
of the yeast a-mating factors. 

The present invention also encompasses antibodies and anti- 
idiotypic antibodies (including Fab fragments), antagonists and 
5 agonists of a NHP, as well as compounds or nucleotide constructs 
that inhibit expression of a NHP gene (transcription factor 
inhibitors, antisense and ribozyme molecules, or gene or 
regulatory sequence replacement constructs), or. promote the 
expression of a NHP {e.g., expression constructs in which NHP 

10 coding sequences are operatively associated with expression 

control elements such as promoters, promoter /enhancers , etc.). 

The NHPs or NHP peptides, NHP fusion proteins, NHP nucleotide 
sequences, antibodies, antagonists and agonists can be useful for 
the detection of mutant NHPs or inappropriately expressed NHPs for 

15 the diagnosis of disease. The NHP proteins or peptides, NHP 

fusion proteins, NHP nucleotide sequences, host cell expression 
systems, antibodies, antagonists, agonists and genetically 
engineered cells and animals can be used for screening for drugs 
(or high throughput screening of combinatorial libraries) 

20 effective in the treatment of the symptomatic or phenotypic 

manifestations of perturbing the normal function of a NHP in the 
body. The use of engineered host cells and/or animals may offer 
an advantage in that such systems allow not only for the 
identification of compounds that bind to the endogenous receptor 

25 for a NHP, but can also identify compounds that trigger NHP- 
mediated activities or pathways. 

Finally, the NHP products can be used as therapeutics. For 
example, soluble derivatives such as NHP peptides/domains 
corresponding to NHP, NHP fusion protein products (especially NHP- 

30 Ig fusion proteins, i.e., fusions of a NHP, or a domain of a NHP, 
to an IgFc) , NHP antibodies and anti-idiotypic antibodies 
(including Fab fragments) , antagonists or agonists (including 
compounds that modulate or act on downstream targets in a NHP- 
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mediated pathway) can be used to directly treat diseases or 
disorders. For instance, the administration of an effective 
amount of soluble NHP , or a NHP-IgFc fusion protein or an anti- 
idiotypic antibody (or its Fab) that mimics the NHP could activate 
5 or effectively antagonize the endogenous NHP receptor. Nucleotide 
constructs encoding such NHP products can be used to genetically 
engineer host cells to express such products in vivo; these 
genetically engineered cells function as "bioreactors " in the body 
delivering a continuous supply of a NHP , a NHP peptide, or a NHP 

10 fusion protein to the body. Nucleotide constructs encoding 

functional NHP, mutant NHPs, as well as antisense and ribozyme 
molecules can also be used in "gene therapy" approaches for the 
modulation of NHP expression. Thus, the invention also 
encompasses pharmaceutical formulations and methods for treating 

15 biological disorders. 

Various aspects of the invention are described in greater 
detail in the subsections below. 

5 . 1 THE NHP SEQUENCES 
2 0 The cDNA sequences and the corresponding deduced amino acid 

sequences of the described NHP are presented in the Sequence 
Listing. SEQ ID N0S:15 describes the NHP ORFs as well as flanking 
regions. The NHP nucleotides were obtained from human cDNA 
libraries using probes and/or primers generated from human gene 

2 5 trapped sequence tags. Expression analysis has provided evidence 

that the described NHP can be expressed in a variety of human 
cells as well as gene trapped human cells. 

5 . 2 NHP AND NHP POLYPEPTIDES 

3 0 NHPs, polypeptides, peptide fragments, mutated, truncated, or 

deleted forms of the NHPs, and/or NHP fusion proteins can be 
prepared for a variety of uses. These uses include, but are not 
limited to, the generation of antibodies, as reagents in 
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diagnostic assays, for the identification of other cellular gene 
products related to a NHP , as reagents in assays for screening for 
compounds that can be as pharmaceutical reagents useful in the 
therapeutic treatment of mental, biological, or medical disorders 
5 and disease. 

The Sequence Listing discloses the amino acid sequence 
encoded by the described NHP polynucleotides. The NHPs display 
initiator methionines in DNA sequence contexts consistent with 
translation initiation sites, and apparently display signal 

10 sequences which can indicate that the described NHP ORFs are 
secreted proteins or possibly membrane associated. 

The NHP amino acid sequences of the invention include the 
amino acid sequences presented in the Sequence Listing as well as 
analogues and derivatives thereof, as well as any oligopeptide 

15 sequence of at least about 10-40, generally about 12-35, or about 
16-30 amino acids in length first disclosed in the Sequence 
Listing. Further, corresponding NHP homologues from other species 
are encompassed by the invention. In fact, any NHP encoded by the 
NHP nucleotide sequences described above are within the scope of 

20 the invention, as are any novel polynucleotide sequences encoding 
all or any novel portion of an amino acid sequence presented in 
the Sequence Listing. The degenerate nature of the genetic code 
is well known, and, accordingly, each amino acid presented in the 
Sequence Listing, is generically representative of the well known 

25 nucleic acid "triplet" codon, or in many cases codons, that can 

encode the amino acid. As such, as contemplated herein, the amino 
acid sequences presented in the Sequence Listing, when taken 
together with the genetic code (see, for example, Table 4-1 at 
page 109 of "Molecular Cell Biology", 1986, J. Darnell efc al . 

30 eds . , Scientific American Books, New York, NY, herein incorporated 
by reference) are generically representative of all the various 
permutations and combinations of nucleic acid sequences that can 
encode such amino acid sequences. 
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The invention also encompasses proteins that are functionally 
equivalent to the NHPs encoded by the presently described 
nucleotide sequences as judged by any of a number of criteria, 
including, but not limited to, the ability to bind and cleave a 
5 substrate of a NHP, or the ability to effect an identical or 
complementary downstream pathway, or a change in cellular 
metabolism (e.g., proteolytic activity, ion flux, tyrosine 
phosphorylation, etc.). Such functionally equivalent NHP proteins 
include, but are not limited to, additions or substitutions of 

10 amino acid residues within the amino acid sequence encoded by the 
NHP nucleotide sequences described above, but which result in a 
silent change, thus producing a functionally equivalent gene 
product. Amino acid substitutions can be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity , 

15 hydrophilicity , and/or the amphipathic nature of the residues 

involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine; polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, and 

20 glutamine; positively charged (basic) amino acids include 

arginine, lysine, and histidine; and negatively charged (acidic) 
amino acids include aspartic acid and glutamic acid. 

A variety of host-expression vector systems can be used to 
express the NHP nucleotide sequences of the invention. Where, as 

25 in the present instance, the NHP products or NHP polypeptides can 
be produced in soluble or secreted forms (by removing one or more 
transmembrane domains where applicable) , the peptide or 
polypeptide can be recovered from the culture media. Such 
expression systems also encompass engineered host cells that 

30 express a NHP, or a functional equivalent, in situ. Purification 
or enrichment of NHP from such expression systems can be 
accomplished using appropriate detergents and lipid micelles and 
methods well known to those skilled in the art. However, such 
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engineered host cells themselves may be used in situations where 
it is important not only to retain the structural and functional 
characteristics of the NHP , but to assess biological activity, 
e.g., in drug screening assays. 
5 The expression systems that may be used for purposes of the 

invention include but are not limited to microorganisms such as 
bacteria (e.g., E. coli, B. subtilis) transformed with recombinant 
bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors 
containing NHP nucleotide sequences; yeast (e.g., Saccharomyces , 

10 Pichia) transformed with recombinant yeast expression vectors 

containing NHP encoding nucleotide sequences; insect cell systems 
infected with recombinant virus expression vectors (e.g., 
baculovirus) containing NHP sequences; plant cell systems infected 
with recombinant virus expression vectors [e.g., cauliflower 

15 mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with 
recombinant plasmid expression vectors (e.g., Ti plasmid) 
containing NHP nucleotide sequences; or mammalian cell systems 
(e.g., COS, CHO, BHK, 293, 3T3) harboring recombinant expression 
constructs containing promoters derived from the genome of 

20 mammalian cells (e.g., metallothionein promoter) or from mammalian 
viruses (e.g., the adenovirus late promoter; the vaccinia virus 
7 . 5K promoter) . 

In bacterial systems, a number of expression vectors may be 
advantageously selected depending upon the use intended for the 

2 5 NHP product being expressed. For example, when a large quantity 
of such a protein is to be produced for the generation of 
pharmaceutical compositions of or containing NHP, or for raising 
antibodies to a NHP, vectors that direct the expression of high 
levels of fusion protein products that are readily purified may be 

30 desirable. Such vectors include, but are not limited, to the E. 
coli expression vector pUR278 (Ruther et al . , 1983, EMBO J. 
2:1791), in which a NHP coding sequence may be ligated 
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individually into the vector in frame with the lacZ coding region 
so that a fusion protein is produced; pIN vectors (Inouye & 
Inouye, 1985, Nucleic Acids Res. 13:3101-3109; Van Heeke & 
Schuster, 1989, J. Biol. Chem. 254:5503-5509); and the like. pGEX 
5 vectors (Pharmacia or American Type Culture Collection) can also 
be used to express foreign polypeptides as fusion proteins with 
glutathione S- transferase (GST) . In general, such fusion proteins 
are soluble and can easily be purified from lysed cells by 
adsorption to glutathione-agarose beads followed by elution in the 

10 presence of free glutathione. The PGEX vectors are designed to 
include thrombin or factor Xa protease cleavage sites so that the 
cloned target gene product can be released from the GST moiety. 

In an insect system, Autographa calif ornica nuclear 
polyhidrosis virus (AcNPV) is used as a vector to express foreign 

15 sequences. The virus grows in Spodoptera frugiperda cells. A NHP 
coding sequence can be cloned individually into non-essential 
regions (for example the polyhedrin gene) of the virus and placed 
under control of an AcNPV promoter (for example the polyhedrin 
promoter) . Successful insertion of NHP coding sequence will 

2 0 result in inactivation of the polyhedrin gene and production of 

non-occluded recombinant virus (i.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene) . These 
recombinant viruses are then used to infect Spodoptera frugiperda 
cells in which the inserted sequence is expressed (e.g., see 
25 Smith et al . , 1983, J. Virol. 46: 584; Smith, U.S. Patent No. 
4, 215, 051) . 

In mammalian host cells, a number of viral-based expression 
systems may be utilized. In cases where an adenovirus is used as 
an expression vector, the NHP nucleotide sequence of interest may 

3 0 be ligated to an adenovirus transcription/ translation control 

complex, e.g., the late promoter and tripartite leader sequence. 
This chimeric sequence may then be inserted in the adenovirus 
genome by in -vitro or in vivo recombination. Insertion in a non- 
19 LEX- 01 18 -USA 
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essential region of the viral genome (e.g., region El or E3) will 
result in a recombinant virus that is viable and capable of 
expressing a NHP product in infected hosts (e.g., See Logan & 
Shenk, 1984, Proc . Natl. Acad. Sci . USA 81 : 3655-3 659 ) . Specific 
initiation signals may also be required for efficient translation 
of inserted NHP nucleotide sequences. These signals include the 
ATG initiation codon and adjacent sequences. In cases where an 
entire NHP sequence or cDNA, including its own initiation codon 
and adjacent sequences, is inserted into the appropriate 
expression vector, no additional translational control signals may 
be needed. However, in cases where only a portion of a NHP coding 
sequence is inserted, exogenous translational control signals, 
including, perhaps, the ATG initiation codon, must be provided. 
Furthermore, the initiation codon must be in phase with the 
reading frame of the desired coding sequence to ensure translation 
of the entire insert. These exogenous translational control 
signals and initiation codons can be of a variety of origins, both 
natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of appropriate transcription enhancer 
elements, transcription terminators, etc. (See Bittner et al . , 
1987, Methods in Enzymol . 153:516-544). 

In addition, a host cell strain may be chosen that modulates 
the expression of the inserted sequences, or modifies and 
processes the gene product in the specific fashion desired. Such 
modifications (e.g., glycosylation) and processing (e.g., 
cleavage) of protein products may be important for the function of 
the protein. Different host cells have characteristic and 
specific mechanisms for the post- translational processing and 
modification of proteins and gene products. Appropriate cell 
lines or host systems can be chosen to ensure the correct 
modification and processing of the foreign protein expressed. To 
this end, eukaryotic host cells which possess the cellular 
machinery for proper processing of the primary transcript, 
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glycosylation, and phosphorylation of the gene product may be 
used. Such mammalian host cells include, but are not limited to, 
CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3 , WI38, and in 
particular, human cell lines. 
5 For long-term, high-yield production of recombinant proteins, 

stable expression is preferred. For example, cell lines which 
stably express the NHP sequences described above can be 
engineered. Rather than using expression vectors which contain 
viral origins of replication, host cells can be transformed with 

10 DNA controlled by appropriate expression control elements (e.g., 
promoter, enhancer sequences, transcription terminators, 
polyadenylation sites, etc.), and a selectable marker. Following 
the introduction of the foreign DNA, engineered cells may be 
allowed to grow for 1-2 days in an enriched media, and then are 

15 switched to a selective media. The selectable marker in the 

recombinant plasmid confers resistance to the selection and allows 
cells to stably integrate the plasmid into their chromosomes and 
grow to form foci which in turn can be cloned and expanded into 
cell lines. This method may advantageously be used to engineer 

20 cell lines which express the NHP product. Such engineered cell 
lines may be particularly useful in screening and evaluation of 
compounds that affect the endogenous activity of the NHP product. 

A number of selection systems may be used, including but not 
limited to the herpes simplex virus thymidine kinase (Wigler, et 

25 al . , 1971, Cell 11:223), hypoxanthine-guanine 

phosphoribosyltransf erase (Szybalska & Szybalski, 1962, Proc . 
Natl. Acad. Sci . USA 48:2026), and adenine 

phosphoribosyltransf erase (Lowy, et al . , 1980, Cell 22:817) genes 
can be employed in tk~, hgprt" or aprt" cells, respectively. Also, 
30 antimetabolite resistance can be used as the basis of selection 
for the following genes: dhfr, which confers resistance to 
methotrexate (Wigler, et al . , 1980, Natl. Acad. Sci. USA 77:3567; 
O'Hare, et al . , 1981, Proc. Natl. Acad. Sci. USA 78:1527); gpt, 
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which confers resistance to mycophenolic acid (Mulligan & Berg, 
1981, Proc. Natl. Acad. Sci . USA 78:2012); neo, which confers 
resistance to the aminoglycoside G-418 (Colberre-Garapin, efc al . , 
1981, J. Mol . Biol. 150:1); and hygro, which confers resistance to 
5 hygromycin (Santerre, et al . , 1984, Gene 30:147). 

Alternatively, any fusion protein can be readily purified by 
utilizing an antibody specific for the fusion protein being 
expressed. For example, a system described by Janknecht et al . 
allows for the ready purification of non-denatured fusion proteins 

10 expressed in human cell lines (Janknecht, et al . , 1991, Proc. 

Natl. Acad. Sci. USA 88:8972-8976). In this system, the gene of 
interest is subcloned into a vaccinia recombination plasmid such 
that the gene's open reading frame is translationally fused to an 
amino- terminal tag consisting of six histidine residues. Extracts 

15 from cells infected with recombinant vaccinia virus are loaded 

onto Ni 2+ ■ nitriloacetic acid-agarose columns and histidine- tagged 
proteins are selectively eluted with imidazole-containing buffers. 

Also encompassed by the present invention are fusion proteins 
that direct the NHP to a target organ and/or facilitate transport 

20 across the membrane into the cytosol. Conjugation of NHPs to 

antibody molecules or their Fab fragments could be used to target 
cells bearing a particular epitope. Attaching the appropriate 
signal sequence to the NHP would also transport the NHP to the 
desired location within the cell. Alternatively targeting of NHP 

25 or its nucleic acid sequence might be achieved using liposome or 
lipid complex based delivery systems. Such technologies are 
described in Liposomes: A Practical Approach , New , RRC ed. , Oxford 
University Press, New York and in U.S. Patents Nos . 4,594,595, 
5,459,127, 5,948,767 and 6,110,490 and their respective 

3 0 disclosures which are herein incorporated by reference in their 
entirety. Additionally embodied are novel protein constructs 
engineered in such a way that they facilitate transport of the NHP 
to the target site or desired organ, where they cross the cell 
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membrane and/or the nucleus where the NHP can exert its functional 
activity. This goal may be achieved by coupling of the NHP to a 
cytokine or other ligand that provides targeting specificity, 
and/or to a protein transducing domain (see generally U.S. 
5 applications Ser. No. 60/111,701 and 60/056,713, both of which are 
herein incorporated by reference, for examples of such transducing 
sequences) to facilitate passage across cellular membranes and can 
optionally be engineered to include nuclear localization 
sequences . 

10 5.3 ANTIBODIES TO NHP PRODUCTS 

Antibodies that specifically recognize one or more epitopes 
of a NHP, or epitopes of conserved variants of a NHP, or peptide 
fragments of a NHP are also encompassed by the invention. Such 
antibodies include but are not limited to polyclonal antibodies, 

15 monoclonal antibodies (mAbs), humanized or chimeric antibodies, 
single chain antibodies, Fab fragments, F(ab') 2 fragments, 
fragments produced by a Fab expression library, anti-idiotypic 
(anti-Id) antibodies, and epi tope-binding fragments of any of the 
above . 

2 0 The antibodies of the invention may be used, for example, in 

the detection of NHP in a biological sample and may, therefore, be 
utilized as part of a diagnostic or prognostic technique whereby 
patients may be tested for abnormal amounts of NHP. Such 
antibodies may also be utilized in conjunction with, for example, 

25 compound screening schemes for the evaluation of the effect of 
test compounds on expression and/or activity of a NHP gene 
product. Additionally, such antibodies can be used in conjunction 
gene therapy to, for example, evaluate the normal and/or 
engineered NHP-expressing cells prior to their introduction into 

30 the patient. Such antibodies may additionally be used as a method 
for the inhibition of abnormal NHP activity. Thus, such 
antibodies may, therefore, be utilized as part of treatment 
methods . 
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For the production of antibodies, various host animals may be 
immunized by injection with the NHP, an NHP peptide (e.g., one 
corresponding to a functional domain of an NHP) , truncated NHP 
polypeptides (NHP in which one or more domains have been deleted) , 
5 functional equivalents, of the NHP or mutated variant of the NHP. 
Such host animals may include but are not limited to pigs, 
rabbits, mice, goats, and rats, to name but a few. Various 
adjuvants may be used to increase the immunological response, 
depending on the host species, including but not limited to 

10 Freund's adjuvant (complete and incomplete), mineral salts such as 
aluminum hydroxide or aluminum phosphate, surface active 
substances such as lysoleci thin, pluronic polyols, polyanions, 
peptides, oil emulsions, and potentially useful human adjuvants 
such as BCG (bacille Calmette-Guerin) and Corynebacterium parwm. 

15 Alternatively, the immune response could be enhanced by 

combination and or coupling with molecules such as keyhole limpet 
hemocyanin, tetanus toxoid, diptheria toxoid, ovalbumin, cholera 
toxin or fragments thereof. Polyclonal antibodies are 
heterogeneous populations of antibody molecules derived from the 

20 sera of the immunized animals. 

Monoclonal antibodies, which are homogeneous populations of 
antibodies to a particular antigen, can be obtained by any 
technique which provides for the production of antibody molecules 
by continuous cell lines in culture. These include, but are not 

25 limited to, the hybridoma technique of Kohler and Milstein, (1975, 
Nature 256:495-497; and U.S. Patent No. 4,376,110), the human B- 
cell hybridoma technique (Kosbor et al . , 1983, Immunology Today 
4:72; Cole et al . , 1983, Proc . Natl. Acad. Sci . USA 80:2026-2030), 
and the EBV-hybridoma technique (Cole et al . , 1985, Monoclonal 

30 Antibodies And Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Such antibodies may be of any immunoglobulin class including IgG, 
IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma 
producing the mAb of this invention may be cultivated in vitro or 
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in vivo. Production of high titers of mAbs in vivo makes this the 
presently preferred method of production. 

In addition, techniques developed for the production of 
"chimeric antibodies" (Morrison et al . , 1984, Proc . Natl. Acad. 
5 Sci., 81:6851-6855; Neuberger et al . , 1984, Nature, 312:604-608; 
Takeda et al . , 1985, Nature, 314:452-454) by splicing the genes 
from a mouse antibody molecule of appropriate antigen specificity 
together with genes from a human antibody molecule of appropriate 
biological activity can be used. A chimeric antibody is a 

10 molecule in which different portions are derived from different 
animal species, such as those having a variable region derived 
from a murine mAb and a human immunoglobulin constant region. Such 
technologies are described in U.S. Patents Nos. 6,075,181 and 
5,877,397 and their respective disclosures which are herein 

15 incorporated by reference in their entirety. Also favored is the 
production of fully humanized monoclonal antibodies as described 
in US Patent No. 6,150,584 and respective disclosures which are 
herein incorporated by reference in their entirety. 

Alternatively, techniques described for the production of 

20 single chain antibodies (U.S. Patent 4,946,778; Bird, 1988, 

Science 242:423-426; Huston et al . , 1988, Proc. Natl. Acad. Sci. 
USA 85:5879-5883; and Ward et al . , 1989, Nature 334:544-546) can 
be adapted to produce single chain antibodies against NHP gene 
products. Single chain antibodies are formed by linking the heavy 

2 5 and light chain fragments of the Fv region via an amino acid 

bridge, resulting in a single chain polypeptide. 

Antibody fragments which recognize specific epitopes may be 
generated by known techniques. For example, such fragments 
include, but are not limited to: the F(ab') 2 fragments which can 

3 0 be produced by pepsin digestion of the antibody molecule and the 

Fab fragments which can be generated by reducing the disulfide 
bridges of the F(ab') 2 fragments. Alternatively, Fab expression 
libraries may be constructed (Huse et al . , 1989, Science, 
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246:1275-1281) to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. 

Antibodies to a NHP can, in turn, be utilized to generate 
anti-idiotype antibodies that "mimic" a given NHP, using 
techniques well known to those skilled in the art. (See, e.g., 
Greenspan & Bona, 1993, FASEB J 7 (5) : 437 -444 ; and Nissinoff, 1991, 
J. Immunol. 147 (8) : 2429-2438) . For example antibodies which bind 
to a NHP domain and competitively inhibit the binding of NHP to 
its cognate receptor can be used to generate anti-idiotypes that 
"mimic" the NHP and, therefore, bind and activate or neutralize a 
receptor. Such anti-idiotypic antibodies or Fab fragments of such 
anti-idiotypes can be used in therapeutic regimens involving a NHP 
signaling pathway. 

The present invention is not to be limited in scope by the 
specific embodiments described herein, which are intended as 
single illustrations of individual aspects of the invention, and 
functionally equivalent methods and components are within the 
scope of the invention. Indeed, various modifications of the 
invention, in addition to those shown and described herein will 
become apparent to those skilled in the art from the foregoing 
description. Such modifications are intended to fall within the 
scope of the appended claims. All cited publications, patents, 
and patent applications are herein incorporated by reference in 
their entirety. 
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WHAT IS CLAIMED IS : 

1. An isolated nucleic acid molecule comprising at 
least 24 contiguous bases of nucleotide sequence first disclosed 

5 in SEQ ID NO: 1. 

2. An isolated nucleic acid molecule comprising a 
nucleotide sequence that: 

(a) encodes the amino acid sequence shown in SEQ ID 
10 NO: 2; and 

(b) hybridizes under stringent conditions to the 
nucleotide sequence of SEQ ID NO: 1 or the 
complement thereof . 

15 3 . An isolated nucleic acid molecule encoding the 

amino acid sequence described in SEQ ID NO: 2. 

4. An isolated oligopeptide comprising at least about 
12 amino acids in a sequence first disclosed in SEQ ID NO : 2 . 

20 
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ABSTRACT 

Novel human polynucleotide and polypeptide sequences are 
disclosed that can be used in therapeutic, diagnostic, and 
pharmacogenomic applications. 
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DECLARATION AND POWER OF ATTORNEY ATTORNEY DOCKET NO . LEX-0 1 IS-VSAS 

FOR PATENT APPLICATION V - ' : - •'■ " ' ■ . V ':'f 

As a below named inventor, I hereby declare that: 

My residence/post office address and citizenship are as stated below next to my name; 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor (if plural names 

are listed below) of the subject matter which is claimed and for which a patent is sought on the invention entitled: 

Novel Human Enzymes and Polynucleotides Encoding the Same 

the specification of which is attached hereto unless the following box is checked: 

( ) was filed on as US Application Serial No. or PCT International Application 

Number and was amended on (if applicable). 

I hereby state that I have reviewed and understood the contents of the above-identified specification, including the claims, as amended 
by any amendment(s) referred to above. I acknowledge the duty to disclose all information which is material to patentability as defined 
in 37 CFR 1.56. 

Foreign Application(s) and/or Claim of Foreign Priority 

I hereby claim foreign priority benefits under Title 35, United States Code Section 119 of any foreign application(s) for patent or inventor(s) certificate listed 
below and have also identified below any foreign application for patent or inventor(s) certificate having a filing date before that of the application on which 
priority is claimed: 



COUNTRY 


APPLICATION NUMBER 


DATE FILED 


PRIORITY CLAIMED UNDER 35 I J.S.C. 119 








YES: NO: 








VR<!- NO- 



Provisional Application 

I hereby claim the benefit under Title 35, United States Code Section 1 19(e) of any United States provisional application(s) listed below: 



APPLICATION SERIAL NUMBER 


FILING DATE 


60/179,000 


1/28/2000 







U.S. Priority Claim 

I hereby claim the benefit under Title 35, United States Code, Section 120 of any United States application(s) listed below and, insofar as the subject matter 
of each of the claims of this application is not disclosed in the prior United States application in the manner provided by the first paragraph of Title 35, United 
States Code Section 112, I acknowledge the duty to disclose material information as defined in Title 37, Code of Federal Regulations, Section 1.56(a) which 
^occurred between the filing date of the prior application and the national or PCT international filing date of this application: 



APPLICATION SERIAL NUMBER 


FILING DATE 


STATUS(patentea7pending/abandoned) 





















POWER OF ATTORNEY: 

As a named inventor, I hereby appoint the following attorney(s) and/or agent(s) listed below to prosecute this application and transact all business in the Patent 
and Trademark Office connected therewith. 



Lance K. Ishimoto, Reg. No. 41866 



Send Correspondence to: 


Direct Telephone Calls To: 




' •V: ; 


Lance K. Ishimoto 
Lexicon Genetics Incorporated 
4000 Research Forest Drive 
The Woodlands, TX 77381 


Lance K. Ishimoto 
(281) 863-3333 




- '-^^ 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are believed to be true; 
and further that these statements were made with the knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or 
both, under Section 1001 of Title 18 of the United States Code and that such willful false statements may jeopardize the validity of the application or any 
patent issued thereon. 
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iNEY DOCKET NO . LEX-01 1 8-1 JS A * 



Full Name of Inventor: Gregory Donoho 



Citizenship: USA 



Residence: 95 Autumn Branch Drive, The Woodlands, TX 77382 



Post Office Address: Same 



Inventor's Signature 



Date 



Full Name of Inventor: Erin Hilbun 



Citizenship: USA 



Residence: 16222 Stuebner Airline, Spring, TX 77379 



Post Office Address: Same 



Inventor's Signature 



Date 



Full Name of Inventor: John Scoville 



Citizenship: USA 



Residence: 8222 Kingsbrook Dr. #543, Houston, TX 77024 



Post Office Address: Same 



Inventor's Signature 



Date 



Full Name of Inventor: C. Alexander Turner, Jr. 



Citizenship: USA 



Residence: 67 Winter Wheat Place, The Woodlands, TX 77381 



Post Office Address: Same 



Inventor's Signature 



Date 



Full Name of Inventor: Glenn Friedrich 



Citizenship: Canada 



Residence: c/o Breland & Breland, Houston, TX 77004 



Post Office Address: Same 



Inventor's Signature 



Date 



Full Name of Inventor: Alejandro Abuin 



Citizenship: Spain 



Residence: 19 Belcarra Place, The Woodlands, TX 77382 



Post Office Address: Same 



Inventor's Signature 



Date 
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ATTORNEY DOCKET NO. LEX-0118-USA 



Full Name of Inventor: Brian Zambrowicz Citizenship: USA 

Residence: 18 Firethorne Place, The Woodlands, TX 77382 

Post Office Address: Same 



Inventor's Signature Date 

Full Name of Inventor: Arthur T. Sands Citizenship: USA 

Residence: 163 Bristol Bend Circle, The Woodlands, TX 77382 

Post Office Address: Same 



Inventor's Signature Date 
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^ftec'tfPCT/PTO 2 7 FEB 2002 



SEQUENCE LISTING 

<110> Donoho, Gregory 
Hilbun, Erin 
Scoville, John 
Turner, C. Alexander Jr. 
Friedrich, Glenn 
Abuin, Alejandro 
Zambrowicz , Brian 
Sands, Arthur T. 

<120> Novel Human Enzymes and Polynucleotides 
Encoding the Same 

<130> LEX-0118-USA 

<150> US 60/179, 000 
<151> 2000-01-28 

<160> 15 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 831 

<212> DNA 

<213> Homo sapiens 

<400> 1 

atgacctctt tccgcttggc 
gtcactcgcg cttgtagctt 
ttgccggaat gctttaattc 
attcctggtg aatccacaca 
attggaggct ctatccctga 
gggcctgatg gaactttact 
cctggaaaaa ttacatttca 
tttgatactc cttactgcag 
cttgcacaaa tctacgcaca 
ctgaccactg gaccagccca 
gtgtatgtgg ccacagcctc 
cacagcaccg tggtgaaccc 
atcgtgtatt cagacataga 
tttagacaga agcgatcaga 

<210> 2 
<211> 276 
<212> PRT 
< 2 1 3 > Homo sapiens 

<400> 2 

Met Thr Ser Phe Arg Leu Ala Leu He Gin Leu Gin He Ser Ser He 

15 10 15 

Lys Ser Asp Asn Val Thr Arg Ala Cys Ser Phe He Arg Glu Ala Ala 

20 25 30 

Thr Gin Gly Ala Lys He Val Ser Leu Pro Glu Cys Phe Asn Ser Pro 

1 



cctcatccag cttcagattt cttccatcaa atcagataac 60 

catccgggag gcagcaacgc aaggagccaa aatagtttct 120 

tccatatgga gcgaaatatt ttcctgaata tgcagagaaa 180 

gaagctttct gaagtagcaa aggaatgcag catatatctc 240 

agaggatgct gggaaattat ataacacctg tgctgtgttt 300 

agcaaagtat agaaagatcc atctgtttga cattgatgtt 360 

agaatctaaa acattgagtc cgggtgatag tttctccaca 420 

agtgggtctg ggcatctgct acgacatgcg gtttgcagag 480 

gagaggctgc cagctgttgg tatatccagg agcttttaat 540 

ttgggagtta cttcagcgaa gccgggctgt tgataatcag 600 

tcctgcccgg gatgacaaag cctcctatgt tgcctgggga 660 

ttggggggag gttctagcca aagctggcac agaagaagca 720 

cctgaagaag ctggctgaaa tacgccagca aatccccgtt 780 

cctctatgct gtggagatga aaaagcccta a 831 
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35 

Tyr Gly Ala Lys 
50 

Ser Thr Gin Lys 
65 

lie Gly Gly Ser 

Cys Ala Val Phe 
100 

lie His Leu Phe 
115 

Ser Lys Thr Leu 
130 

Tyr Cys Arg Val 
145 

Leu Ala Gin lie 

Gly Ala Phe Asn 
180 

Arg Ser Arg Ala 
195 

Ala Arg Asp Asp 
210 

Val Asn Pro Trp 
225 

lie Val Tyr Ser 

Gin He Pro Val 
260 

Met Lys Lys Pro 
275 



40 

Tyr Phe Pro Glu 
55 

Leu Ser Glu Val 
70 

He Pro Glu Glu 
85 

Gly Pro Asp Gly 

Asp He Asp Val 
120 

Ser Pro Gly Asp 
135 

Gly Leu Gly He 
150 

Tyr Ala Gin Arg 
165 

Leu Thr Thr Gly 

Val Asp Asn Gin 
200 

Lys Ala Ser Tyr 
215 

Gly Glu Val Leu 
230 

Asp He Asp Leu 
245 

Phe Arg Gin Lys 



Tyr Ala Glu Lys 
60 

Ala Lys Glu Cys 
75 

Asp Ala Gly Lys 
90 

Thr Leu Leu Ala 
105 

Pro Gly Lys He 

Ser Phe Ser Thr 
140 

Cys Tyr Asp Met 
155 

Gly Cys Gin Leu 
170 

Pro Ala His Trp 
185 

Val Tyr Val Ala 

Val Ala Trp Gly 
220 

Ala Lys Ala Gly 

235 

Lys Lys Leu Ala 
250 

Arg Ser Asp Leu 

265 



45 

He Pro Gly Glu 

Ser He Tyr Leu 
80 

Leu Tyr Asn Thr 
95 

Lys Tyr Arg Lys 
110 

Thr Phe Gin Glu 
125 

Phe Asp Thr Pro 

Arg Phe Ala Glu 
160 

Leu Val Tyr Pro 
175 

Glu Leu Leu Gin 
190 

Thr Ala Ser Pro 
205 

His Ser Thr Val 

Thr Glu Glu Ala 
240 

Glu He Arg Gin 
255 

Tyr Ala Val Glu 
270 



<210> 3 
<211> 480 
<212> DNA 

<213> Homo sapiens 



<400> 3 

atgtcatgga ggatttcccc tgccacacca tgctgtaggg agttaacttt tcatttgtgc 60 

attttctgtt tggaaacagc ttactgcaga gtgggtctgg gcatctgcta cgacatgcgg 120 

tttgcagagc ttgcacaaat ctacgcacag agaggctgcc agctgttggt atatccagga 180 

gcttttaatc tgaccactgg accagcccat tgggagttac ttcagcgaag ccgggctgtt 240 

gataatcagg tgtatgtggc cacagcctct cctgcccggg atgacaaagc ctcctatgtt 300 

gcctggggac acagcaccgt ggtgaaccct tggggggagg ttctagccaa agctggcaca 360 

gaagaagcaa tcgtgtattc agacatagac ctgaagaagc tggctgaaat acgccagcaa 420 

atccccgttt ttagacagaa gcgatcagac ctctatgctg tggagatgaa aaagccctaa 480 



<210> 4 
<211> 159 
<212> PRT 

<213> Homo sapiens 



<400> 4 

Met Ser Trp Arg He Ser Pro Ala Thr Pro Cys Cys Arg Glu Leu Thr 

1 5 io 15 

Phe His Leu Cys He Phe Cys Leu Glu Thr Ala Tyr Cys Arg Val Gly 



2 
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20 25 30 

Leu Gly lie Cys Tyr Asp Met Arg Phe Ala Glu Leu Ala Gin lie Tyr 

35 40 45 

Ala Gin Arg Gly Cys Gin Leu Leu Val Tyr Pro Gly Ala Phe Asn Leu 

50 55 60 

Thr Thr Gly Pro Ala His Trp Glu Leu Leu Gin Arg Ser Arg Ala Val 
65 70 75 80 

Asp Asn Gin Val Tyr Val Ala Thr Ala Ser Pro Ala Arg Asp Asp Lys 

85 90 95 

Ala Ser Tyr Val Ala Trp Gly His Ser Thr Val Val Asn Pro Trp Gly 

100 105 110 

Glu Val Leu Ala Lys Ala Gly Thr Glu Glu Ala lie Val Tyr Ser Asp 

115 120 125 

lie Asp Leu Lys Lys Leu Ala Glu lie Arg Gin Gin lie Pro Val Phe 

130 135 140 

Arg Gin Lys Arg Ser Asp Leu Tyr Ala Val Glu Met Lys Lys Pro 
145 150 155 



<210> 5 

<211> 366 

<212> DNA 

<213> Homo sapiens 



<400> 5 

atgcggtttg cagagcttgc acaaatctac gcacagagag gctgccagct gttggtatat 60 

ccaggagctt ttaatctgac cactggacca gcccattggg agttacttca gcgaagccgg 120 

gctgttgata atcaggtgta tgtggccaca gcctctcctg cccgggatga caaagcctcc 180 

tatgttgcct ggggacacag caccgtggtg aacccttggg gggaggttct agccaaagct 240 

ggcacagaag aagcaatcgt gtattcagac atagacctga agaagctggc tgaaatacgc 3 00 

cagcaaatcc ccgtttttag acagaagcga tcagacctct atgctgtgga gatgaaaaag 3 60 

ccctaa 366 



<210> 6 
<211> 121 
<212> PRT 

<213> Homo sapiens 



<400> 6 

Met Arg Phe Ala 
1 

Leu Leu Val Tyr 
20 

Trp Glu Leu Leu 
35 

Ala Thr Ala Ser 
50 

Gly His Ser Thr 
65 

Gly Thr Glu Glu 

Ala Glu lie Arg 
100 

Leu Tyr Ala Val 
115 

<210> 7 



Glu Leu Ala Gin 
5 

Pro Gly Ala Phe 

Gin Arg Ser Arg 
40 

Pro Ala Arg Asp 
55 

Val Val Asn Pro 
70 

Ala lie Val Tyr 
85 

Gin Gin lie Pro 

Glu Met Lys Lys 
120 



lie Tyr Ala Gin 
10 

Asn Leu Thr Thr 
25 

Ala Val Asp Asn 

Asp Lys Ala Ser 
60 

Trp Gly Glu Val 
75 

Ser Asp lie Asp 
90 

Val Phe Arg Gin 

105 

Pro 



Arg Gly Cys Gin 
15 

Gly Pro Ala His 
30 

Gin Val Tyr Val 
45 

Tyr Val Ala Trp 

Leu Ala Lys Ala 
80 

Leu Lys Lys Leu 
95 

Lys Arg Ser Asp 
110 
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<211> 507 
<212> DNA 
<213> Homo sapiens 



<400> 7 

atgtcatgga ggatttcccc tgccacacca tgctgtaggg agttaacttt tcatttgtgc 60 

attttctgtt tggaaacagc ttactgcaga gtgggtctgg gcatctgcta cgacatgcgg 120 

tttgcagagc ttgcacaaat ctacgcacag agaggctgcc agctgttggt atatccagga 180 

gcttttaatc tgaccactgg accagcccat tgggagttac ttcagcgaag ccgggctgtt 240 

gataatcagg tgtatgtggc cacagcctct cctgcccggg atgacaaagc ctcctatgtt 3 00 

gcctggggac acagcaccgt ggtgaaccct tggggggagg ttctagccaa agctggcaca 3 60 

gaagaagcaa tcgtgtattc agacatagac ctgaagaagc tggctgaaat acgccagcaa 420 

atccccgttt ttagacagaa gcgaaatatt ttcctgaata tgcagagaaa attcctggtg 480 

aatccacaca gaagctttct gaagtag 507 



<210> 8 
<211> 168 
<212> PRT 

<213> Homo sapiens 



<400> 8 

Met Ser Trp Arg 
1 

Phe His Leu Cys 
20 

Leu Gly lie Cys 
35 

Ala Gin Arg Gly 
50 

Thr Thr Gly Pro 
65 

Asp Asn Gin Val 

Ala Ser Tyr Val 
100 

Glu Val Leu Ala 
115 

lie Asp Leu Lys 
130 

Arg Gin Lys Arg 
145 

Asn Pro Hi s Arg 



lie Ser Pro Ala 
5 

lie Phe Cys Leu 

Tyr Asp Met Arg 
40 

Cys Gin Leu Leu 
55 

Ala His Trp Glu 
70 

Tyr Val Ala Thr 
85 

Ala Trp Gly His 

Lys Ala Gly Thr 
120 

Lys Leu Ala Glu 
135 

Asn lie Phe Leu 
150 

Ser Phe Leu Lys 
165 



Thr Pro Cys Cys 
10 

Glu Thr Ala Tyr 
25 

Phe Ala Glu Leu 

Val Tyr Pro Gly 
60 

Leu Leu Gin Arg 
75 

Ala Ser Pro Ala 
90 

Ser Thr Val Val 
105 

Glu Glu Ala lie 

lie Arg Gin Gin 
140 

Asn Met Gin Arg 
155 



Arg Glu Leu Thr 
15 

Cys Arg Val Gly 
30 

Ala Gin lie Tyr 
45 

Ala Phe Asn Leu 

Ser Arg Ala Val 
80 

Arg Asp Asp Lys 
95 

Asn Pro Trp Gly 
110 

Val Tyr Ser Asp 
125 

He Pro Val Phe 

Lys Phe Leu Val 
160 



<210> 9 

<211> 393 

<212> DNA 

<213> Homo sapiens 



<400> 9 

atgcggtttg cagagcttgc acaaatctac gcacagagag gctgccagct gttggtatat 60 

ccaggagctt ttaatctgac cactggacca gcccattggg agttacttca gcgaagccgg 120 

gctgttgata atcaggtgta tgtggccaca gcctctcctg cccgggatga caaagcctcc 180 

tatgttgcct ggggacacag caccgtggtg aacccttggg gggaggttct agccaaagct 240 

ggcacagaag aagcaatcgt gtattcagac atagacctga agaagctggc tgaaatacgc 3 00 

cagcaaatcc ccgtttttag acagaagcga aatattttcc tgaatatgca gagaaaattc 360 
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ctggtgaatc cacacagaag ctttctgaag tag 



393 



<210> 10 
<211> 130 
<212> PRT 

<213> Homo sapiens 



<400> 10 

Met Arg Phe Ala Glu Leu Ala Gin lie Tyr 

15 10 
Leu Leu Val Tyr Pro Gly Ala Phe Asn Leu 

20 25 
Trp Glu Leu Leu Gin Arg Ser Arg Ala Val 

35 40 
Ala Thr Ala Ser Pro Ala Arg Asp Asp Lys 

50 55 
Gly His Ser Thr Val Val Asn Pro Trp Gly 
65 70 

Gly Thr Glu Glu Ala lie Val Tyr Ser Asp 

85 90 
Ala Glu lie Arg Gin Gin lie Pro Val Phe 

100 105 
Phe Leu Asn Met Gin Arg Lys Phe Leu Val 
115 120 

Leu Lys 
130 



Ala Gin Arg Gly 
Thr Thr 
Asp Asn 



Ala Ser 
60 

Glu Val 
75 

lie Asp 
Arg Gin 
Asn Pro 



Gly Pro 

30 
Gin Val 
45 

Tyr Val 



Leu Ala 

Leu Lys 

Lys Arg 
110 
His Arg 
125 



Cys Gin 
15 

Ala His 

Tyr Val 

Ala Trp 

Lys Ala 
80 

Lys Leu 
95 

Asn lie 
Ser Phe 



<210> 11 

<211> 459 

<212> DNA 

<213> Homo sapiens 



<400> 11 

atgacctctt 

gtcactcgcg 

ttgccggaat 

attcctggtg 

attggaggct 

gggcctgatg 

cctggaaaaa 

tttgatactc 



tccgcttggc 
cttgtagc tt 
gctttaattc 
aatccacaca 
ctatccctga 
gaac tttac t 
ttacatttca 
gtatgtacca 



cctcatccag 
catccgggag 
tccatatgga 
gaagctttct 
agaggatgct 
agcaaagtat 
agaatctaaa 
gataagtttg 



cttcagattt 
gcagcaacgc 
gcgaaatatt 
gaagtagcaa 
gggaaattat 
agaaagatcc 
acattgagtc 
cctctttag 



cttccatcaa 
aaggagccaa 
ttcctgaata 
aggaatgcag 
ataacacctg 
atctgtttga 
cgggtgatag 



atcagataac 
aatagtttct 
tgcagagaaa 
catatatctc 
tgctgtgttt 
cattgatgtt 
tttctccaca 



60 
120 
180 
240 
300 
360 
420 
459 



<210> 12 
<211> 152 
<212> PRT 

<213> Homo sapiens 



<400> 12 

Met Thr Ser Phe Arg Leu Ala Leu lie Gin Leu Gin lie Ser Ser lie 

15 10 15 

Lys Ser Asp Asn Val Thr Arg Ala Cys Ser Phe lie Arg Glu Ala Ala 

20 25 30 

Thr Gin Gly Ala Lys lie Val Ser Leu Pro Glu Cys Phe Asn Ser Pro 

35 40 45 

Tyr Gly Ala Lys Tyr Phe Pro Glu Tyr Ala Glu Lys lie Pro Gly Glu 
50 55 60 
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Ser Thr Gin Lys 
65 

lie Gly Gly Ser 

Cys Ala Val Phe 
100 

lie His Leu Phe 
115 

Ser Lys Thr Leu 
130 

Met Tyr Gin lie 
145 



Leu Ser Glu Val 
70 

lie Pro Glu Glu 
85 

Gly Pro Asp Gly 

Asp lie Asp Val 
120 

Ser Pro Gly Asp 
135 

Ser Leu Pro Leu 
150 



Ala Lys Glu Cys 
75 

Asp Ala Gly Lys 
90 

Thr Leu Leu Ala 
105 

Pro Gly Lys lie 

Ser Phe Ser Thr 
140 



Ser lie Tyr Leu 
80 

Leu Tyr Asn Thr 
95 

Lys Tyr Arg Lys 
110 

Thr Phe Gin Glu 
125 

Phe Asp Thr Arg 



<210> 13 

<211> 858 

<212> DNA 

<213> Homo sapiens 



<400> 13 

atgacctctt tccgcttggc cctcatccag cttcagattt cttccatcaa atcagataac 60 

gtcactcgcg cttgtagctt catccgggag gcagcaacgc aaggagccaa aatagtttct 120 

ttgccggaat gctttaattc tccatatgga gcgaaatatt ttcctgaata tgcagagaaa 180 

attcctggtg aatccacaca gaagctttct gaagtagcaa aggaatgcag catatatctc 240 

attggaggct ctatccctga agaggatgct gggaaattat ataacacctg tgctgtgttt 300 

gggcctgatg gaactttact agcaaagtat agaaagatcc atctgtttga cattgatgtt 360 

cctggaaaaa ttacatttca agaatctaaa acattgagtc cgggtgatag tttctccaca 420 

tttgatactc cttactgcag agtgggtctg ggcatctgct acgacatgcg gtttgcagag 480 

cttgcacaaa tctacgcaca gagaggctgc cagctgttgg tatatccagg agcttttaat 540 

ctgaccactg gaccagccca ttgggagtta cttcagcgaa gccgggctgt tgataatcag 600 

gtgtatgtgg ccacagcctc tcctgcccgg gatgacaaag cctcctatgt tgcctgggga 660 

cacagcaccg tggtgaaccc ttggggggag gttctagcca aagctggcac agaagaagca 720 

atcgtgtatt cagacataga cctgaagaag ctggctgaaa tacgccagca aatccccgtt 780 

tttagacaga agcgaaatat tttcctgaat atgcagagaa aattcctggt gaatccacac 840 

agaagctttc tgaagtag 858 



<210> 14 

<211> 285 

<212> PRT 

<213> Homo sapiens 



<400> 14 
Met Thr Ser Phe 
1 

Lys Ser Asp Asn 
20 

Thr Gin Gly Ala 
35 

Tyr Gly Ala Lys 
50 

Ser Thr Gin Lys 
65 

lie Gly Gly Ser 

Cys Ala Val Phe 
100 

lie His Leu Phe 



Arg Leu Ala Leu 
5 

Val Thr Arg Ala 

Lys lie Val Ser 
40 

Tyr Phe Pro Glu 
55 

Leu Ser Glu Val 
70 

lie Pro Glu Glu 
85 

Gly Pro Asp Gly 
Asp lie Asp Val 



lie Gin Leu Gin 
10 

Cys Ser Phe lie 
25 

Leu Pro Glu Cys 

Tyr Ala Glu Lys 
60 

Ala Lys Glu Cys 
75 

Asp Ala Gly Lys 
90 

Thr Leu Leu Ala 
105 

Pro Gly Lys lie 



lie Ser Ser lie 
15 

Arg Glu Ala Ala 
30 

Phe Asn Ser Pro 
45 

lie Pro Gly Glu 

Ser lie Tyr Leu 
80 

Leu Tyr Asn Thr 
95 

Lys Tyr Arg Lys 
110 

Thr Phe Gin Glu 
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115 120 125 



Ser 


Lys 


Thr 


Leu 


Ser 


Pro 


Glv 


Asp 


Ser 


Phe 


Ser 


Thr 


Phe 


Asp 


1 ill. 


Pro 




130 










135 










140 










Tyr 


Cys 


Arg 


Val 


Gly 


Leu 


Gly 


He 


Cys 


Tvr 


Asp 


Met 


Arg 


Phe 


A "1 a 




145 










150 










155 










S. O \J 


Leu 


Ala 


Gin 


He 


Tyr 


Ala 


Gin 


Arg 


Glv 


Cys 


Gin 


Leu 


Leu 


Val 




Pro 

XT i U 










165 










170 










175 




Gly 


Ala 


Phe 


Asn 


Leu 


Thr 


Thr 


Gly 


Pro 


Ala 


His 


Trn 

J. J. ±J 


Glu 


Leu 












180 










185 










190 






Arg 


Ser 


Arg 


Ala 


Val 


Asp 


Asn 


Gin 


Val 


Tvr 


Val 


Ala 


Thr 


Ala 


Ser 


Pro 






195 










200 










205 








Ala 


Arg 


Asp 


Asp 


Lys 


Ala 


Ser 


Tvr 


Val 


Ala 




Gly 


His 


Ser 


± L IS- 


val 




210 










215 










220 










Val 


Asn 


Pro 


Trp 


Gly 


Glu 


Val 


Leu 


Ala 


Lys 


AT ^ 




1 LIS. 


pi n 

kaJLU 


C' 1 n 


A±a 


225 










230 










235 










240 


He 


Val 


Tyr 


Ser 


Asp 


He 


Asp 


Leu 


Lys 


Lys 


Leu 


Ala 


Glu 


He 


Arg 


Gin 










245 










250 










255 




Gin 


He 


Pro 


Val 


Phe 


Arg 


Gin 


Lys 


Arg 


Asn 


He 


Phe 


Leu 


Asn 


Met 


Gin 








260 










265 










270 






Arg 


Lys 


Phe 


Leu 


Val 


Asn 


Pro 


His 


Arg 


Ser 


Phe 


Leu 


Lys 









275 280 285 



<210> 15 

<211> 3093 

<212> DNA 

<213> Homo sapiens 

<400> 15 

ggatggtggg gcatacctgt ggtcccagct acataagagg ctgagacaag aggattgcct 60 

gaactgagta ggtcaaggct gcagtggacc atgtttgtgc cactgcactc cagcctgggc 120 

gacagaacaa ggccctgcct caaaataaaa aatattagct aatggaaagt gattatcata 180 

aaagctaaaa gggaacttta aagaacagaa gaaaagcaaa tatgatgtat agctactacc 240 

tccaggaaga aataagcttg gaagagcccc caacctcctt gctccagggc tgagcacaga 300 

ccttgtcagg gctggctaca taatttgtgg ggcccagttc ccttgttcag atagcaagag 360 

aaaagtgctg ttagcttttc cttctgcagt atctctttca acctctcatg gtgttatttg 420 

ctgtttaatg tcatgttctc ttggacacat gaatacttat ggggtaagtg cagactttta 480 

gaggtgcctg ggacccctgt cctgtgaata ggcatgtgtg cagctcactg gctgccaggt 540 

tttccctctg ccagcagcgg gatcgatgtg ctgtgaccca gccagtagtg gggaaactga 600 

gacagacatc ttcccttccc atgagctggg cctgctcatg ggaattatgt gagcagcttc 660 

caaggaatca cactttctgt gctgggacat actcaagtat atggattgga ggtagacgag 72 0 

aggcccattg aacaaacagt aagggacagg accatattca aacccagtct ttttacttta 780 

agccatattc ctcatttcat tcccctacac tgcgtagtaa gaagctggtt cactctagat 840 

tcttgtgcct ggcatgggac tttgcccatg gatattgctc tatctccaga tagattttag 900 

actattgaca ttttggacag gataattctt cgttgtgtta tggagggggt tgtcctatgc 960 

attgtaggat gtttggcagt atccttggtc tctattcatt agatgccact catacctcat 1020 

cagttgtggc atcaaaggta tcttcagaca ttgtcagatg tccccccggg gacataactg 1080 

ccttccattt gagaactatg gctctgtctg aatccagcag ttcgatcttc tgatagctgt 1140 

tttcttttgt ctttgttctc agcccccccc cccccggtag gacccgcggt ccgccggatc 1200 

tccagcgctc agtccgcgcc gcaggtggtg cttgtctgca gagtcatgac ctctttccgc 1260 

ttggccctca tccagcttca gatttcttcc atcaaatcag ataacgtcac tcgcgcttgt 1320 

agcttcatcc gggaggcagc aacgcaagga gccaaaatag tttctttgcc ggaatgcttt 1380 

aattctccat atggagcgaa atattttcct gaatatgcag agaaaattcc tggtgaatcc 1440 

acacagaagc tttctgaagt agcaaaggaa tgcagcatat atctcattgg aggctctatc 1500 

cctgaagagg atgctgggaa attatataac acctgtgctg tgtttgggcc tgatggaact 1560 

ttactagcaa agtatagaaa gatccatctg tttgacattg atgttcctgg aaaaattaca 1620 

tttcaagaat ctaaaacatt gagtccgggt gatagtttct ccacatttga tactcgtatg 1680 
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taccagataa gtttgcctct ttagcaatct cagtagaaga caatcaggta tttatttctt 1740 

ttttgtctct ctccgatttc ttcacataac ctaactgaaa gaccataagt gagaaaggca 1800 

gagaatcatc acagatctgg aaagttcggg cttatttgag aactaaggat ttgacacgat 1860 

tttgcccttt gatttgattg tagcttcctg ttacggcttc cagagtatac ctattaggct 1920 

acagttgagt acctcccatc tagataataa gcattcaatt agaatgaatt tctcatcttt 1980 

actccgctga tgtaaatgat gtctttatga gatgaagtcc aagtaggaat gagcttgtaa 2040 

attatctctg tcctcaggtc ctgtgttaat ttatccctgt cagtgttttg tgatcattat 2100 

gtcatggagg atttcccctg ccacaccatg ctgtagggag ttaacttttc atttgtgcat 2160 

tttctgtttg gaaacagctt actgcagagt gggtctgggc atctgctacg acatgcggtt 2220 

tgcagagctt gcacaaatct acgcacagag aggctgccag ctgttggtat atccaggagc 2280 

ttttaatctg accactggac cagcccattg ggagttactt cagcgaagcc gggctgttga 2340 

taatcaggtg tatgtggcca cagcctctcc tgcccgggat gacaaagcct cctatgttgc 2400 

ctggggacac agcaccgtgg tgaacccttg gggggaggtt ctagccaaag ctggcacaga 2460 

agaagcaatc gtgtattcag acatagacct gaagaagctg gctgaaatac gccagcaaat 2520 

ccccgttttt agacagaagc gatcagacct ctatgctgtg gagatgaaaa agccctaaag 2580 

tttatgtttc taatgtgtca cagaatagga cgatatgatt ctacaacata atcaactccc 2640 

tattaaattc tttaatgaag aaaaaaaaaa aaaaaaaaaa aaaaaatatt ttcctgaata 2700 

tgcagagaaa attcctggtg aatccacaca gaagctttct gaagtagcaa aggaatgcag 2760 

catatatctc attggaggct ctatccctga agaggatgct gggaaattat ataacacctg 2820 

tgctgtgttt gggcctgatg gaactttact agcaaagtat agaaagatcc atctgtttga 2880 

cattgatgtt cctggaaaaa ttacatttca agaatctaaa acattgagtc cgggtgatag 2940 

tttctccaca tttgatactc cttactgcag agtgggtctg ggcatctgct acgacatgcg 3000 

gtttgcagag cttgcacaaa tctacgcaca gagaggctgc cagctgttgg tatatccagg 3060 

agcttttaat ctgaccactg gaccagccca ttg 3093 
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